Dataset: Turkish-English Parallel Corpus
dc.contributor.author | Taşçı, Şerafettin | |
dc.contributor.author | Güngör, Mustafa | |
dc.contributor.author | Güngör, Tunga | |
dc.date.accessioned | 2023-03-03T22:22:45Z | |
dc.date.available | 2023-03-03T22:22:45Z | |
dc.date.issued | 2006 | |
dc.description | The corpus is in the form of a text file which includes 229,554 data instances (sentence pairs). Each data instance is formed of a sentence id, Turkish sentence, and English sentence. Example: Sentence id: 148514 Turkish sentence: Belki de vakit gelsin diye bekliyorlar. English sentence: Maybe they've been biding their time | |
dc.identifier.uri | https://tulap.cmpe.boun.edu.tr/handle/20.500.12913/64 | |
dc.language.iso | Turkish | |
dc.publisher | Boğaziçi University | |
dc.relation.isreferencedby | https://www.cmpe.boun.edu.tr/~gungort/papers/Compiling%20a%20Turkish-English%20Bilingual%20Corpus%20and%20Developing%20an%20Algorithm%20for%20Sentence%20Alignment.pdf | |
dc.rights | Apache License 2.0 | |
dc.rights.uri | http://opensource.org/licenses/Apache-2.0 | |
dc.subject | Bilingual corpus | |
dc.subject | Machine translation | |
dc.subject | Sentence aligned | |
dc.title | Turkish-English Parallel Corpus | |
dc.type | corpus | |
dspace.entity.type | Dataset | |
local.contact.person | Tunga, Güngör, gungort@boun.edu.tr, Boğaziçi University |
Files
License bundle
1 - 1 of 1