Dataset:
Turkish-English Parallel Corpus

dc.contributor.authorTaşçı, Şerafettin
dc.contributor.authorGüngör, Mustafa
dc.contributor.authorGüngör, Tunga
dc.date.accessioned2023-03-03T22:22:45Z
dc.date.available2023-03-03T22:22:45Z
dc.date.issued2006
dc.descriptionThe corpus is in the form of a text file which includes 229,554 data instances (sentence pairs). Each data instance is formed of a sentence id, Turkish sentence, and English sentence. Example: Sentence id: 148514 Turkish sentence: Belki de vakit gelsin diye bekliyorlar. English sentence: Maybe they've been biding their time
dc.identifier.urihttps://tulap.cmpe.boun.edu.tr/handle/20.500.12913/64
dc.language.isoTurkish
dc.publisherBoğaziçi University
dc.relation.isreferencedbyhttps://www.cmpe.boun.edu.tr/~gungort/papers/Compiling%20a%20Turkish-English%20Bilingual%20Corpus%20and%20Developing%20an%20Algorithm%20for%20Sentence%20Alignment.pdf
dc.rightsApache License 2.0
dc.rights.urihttp://opensource.org/licenses/Apache-2.0
dc.subjectBilingual corpus
dc.subjectMachine translation
dc.subjectSentence aligned
dc.titleTurkish-English Parallel Corpus
dc.typecorpus
dspace.entity.typeDataset
local.contact.personTunga, Güngör, gungort@boun.edu.tr, Boğaziçi University
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Turkish-English_Parallel_Corpus.7z.001
Size:
10 MB
Format:
Unknown data format
Description:
Unknown
No Thumbnail Available
Name:
Turkish-English_Parallel_Corpus.7z.002
Size:
2.94 MB
Format:
Unknown data format
Description:
Unknown
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Plain Text
Description:
Collections