Show simple item record

 
dc.contributor.author Marşan, Büşra
dc.contributor.author Türk, Utku
dc.contributor.author Atmaca, Furkan
dc.contributor.author Özateş, Şaziye Betül
dc.contributor.author Berk, Gözde
dc.contributor.author Bedir, Seyyit Talha
dc.contributor.author Köksal, Abdullatif
dc.contributor.author Başaran, Balkız Öztürk
dc.contributor.author Güngör, Tunga
dc.contributor.author Özgür, Arzucan
dc.contributor.author Uskudarli, Susan
dc.contributor.author Akkurt, Salih Furkan
dc.date.accessioned 2022-07-26T15:05:18Z
dc.date.available 2022-07-26T15:05:18Z
dc.date.issued 2022
dc.identifier.uri https://hdl.handle.net/20.500.12913/33
dc.description This dataset is the re-annotated version of BOUN Treebank. Extracted from Turkish National Corpus (TNC), BOUN Treebank consists of 9,761 sentences (121,214 tokens) from five different text types: Biographical texts, national newspapers, instructional texts, popular culture articles, and essays. The syntactic dependency relations and morphological features of the sentences were manually annotated by linguists following the UD scheme. Some statistics on the treebank: - Although the dataset shows word order variance, more than %70 of the sentences have OV and SV word order. - The average token count of the updated treebank is 12.74 and the average arc length is 2.90.
dc.language.iso tur
dc.publisher Boğaziçi University
dc.relation.isreferencedby https://arxiv.org/abs/2207.11782
dc.subject dependency annotation
dc.subject universal dependencies
dc.title BOUN Treebank v2.11
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding Community
contact.person Büşra Marşan busra.marsan@boun.edu.tr Boğaziçi University
sponsor TÜBİTAK 16909 Dilbilim Temelli Türkçe Doğal Dil İşleme Platformu nationalFunds
size.info 9761 sentences
files.size 9842598
files.count 3


 Files in this item

 Download all files in item (9.39 MB)
Icon
Name
tr_boun_v2-dev.conllu
Size
944.41 KB
Format
Unknown
Description
Turkish BOUN Treebank v2, dev file
MD5
1f0a95b514159dd14d003c02f836ae45
 Download file
Icon
Name
tr_boun_v2-test.conllu
Size
933.25 KB
Format
Unknown
Description
Turkish BOUN Treebank v2, test file
MD5
7d71dd9fc08ff4ddb13e2de0449c3bec
 Download file
Icon
Name
tr_boun_v2-train.conllu
Size
7.55 MB
Format
Unknown
Description
Turkish BOUN Treebank v2, train file
MD5
03cb502c226e1eef1f10c953f4a1af52
 Download file

Show simple item record