Dataset:
Turkish Question Answering Dataset (SQuAD-TR).

creativework.keywordsQuestion Answering
creativework.keywordsOpen Domain Question Answering
creativework.keywordsOpenQA
creativework.keywordsLow-Resource Languages
creativework.keywordsMachine Translation
dc.contributor.authorBudur, Emrah
dc.contributor.authorRıza, Özçelik
dc.contributor.authorSoylu, Dilara
dc.contributor.authorKhattab, Omar
dc.contributor.authorGüngör, Tunga
dc.contributor.authorPotts, Christopher
dc.date.accessioned2023-03-03T22:22:45Z
dc.date.available2023-03-03T22:22:45Z
dc.date.issued2024-01-07
dc.descriptionThe corpus is derived from SQuAD2.0 using Amazon Translate. It consists of 61,293 question-answer pairs and 18,776 paragraphs containing answers of the questions.
dc.identifier.urihttps://tulap.cmpe.boun.edu.tr/handle/20.500.12913/60
dc.language.isoTurkish
dc.publisherBoğaziçi University
dc.relation.isreferencedbyhttps://arxiv.org/abs/2401.03590
dc.rightsCreative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.urihttp://creativecommons.org/licenses/by-sa/4.0/
dc.source.urihttps://github.com/boun-tabi/SQuAD-TR
dc.subjectQuestion Answering
dc.subjectMachine Translation
dc.titleTurkish Question Answering Dataset (SQuAD-TR).
dc.typecorpus
dspace.entity.typeDataset
local.contact.personEmrah, Budur, emrah.budur@yahoo.com, Boğaziçi University
Files
Original bundle
Now showing 1 - 4 of 4
No Thumbnail Available
Name:
squad-tr-dev-v1.0.0-excluded.json.gz
Size:
375.23 KB
Format:
Unknown data format
No Thumbnail Available
Name:
squad-tr-dev-v1.0.0.json.gz
Size:
644.86 KB
Format:
Unknown data format
No Thumbnail Available
Name:
squad-tr-train-v1.0.0-excluded.json.gz
Size:
4.66 MB
Format:
Unknown data format
No Thumbnail Available
Name:
squad-tr-train-v1.0.0.json.gz
Size:
8.36 MB
Format:
Unknown data format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Plain Text
Description:
Collections