Dataset: Turkish Question Answering Dataset (SQuAD-TR).
creativework.keywords | Question Answering | |
creativework.keywords | Open Domain Question Answering | |
creativework.keywords | OpenQA | |
creativework.keywords | Low-Resource Languages | |
creativework.keywords | Machine Translation | |
dc.contributor.author | Budur, Emrah | |
dc.contributor.author | Rıza, Özçelik | |
dc.contributor.author | Soylu, Dilara | |
dc.contributor.author | Khattab, Omar | |
dc.contributor.author | Güngör, Tunga | |
dc.contributor.author | Potts, Christopher | |
dc.date.accessioned | 2023-03-03T22:22:45Z | |
dc.date.available | 2023-03-03T22:22:45Z | |
dc.date.issued | 2024-01-07 | |
dc.description | The corpus is derived from SQuAD2.0 using Amazon Translate. It consists of 61,293 question-answer pairs and 18,776 paragraphs containing answers of the questions. | |
dc.identifier.uri | https://tulap.cmpe.boun.edu.tr/handle/20.500.12913/60 | |
dc.language.iso | Turkish | |
dc.publisher | Boğaziçi University | |
dc.relation.isreferencedby | https://arxiv.org/abs/2401.03590 | |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) | |
dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ | |
dc.source.uri | https://github.com/boun-tabi/SQuAD-TR | |
dc.subject | Question Answering | |
dc.subject | Machine Translation | |
dc.title | Turkish Question Answering Dataset (SQuAD-TR). | |
dc.type | corpus | |
dspace.entity.type | Dataset | |
local.contact.person | Emrah, Budur, emrah.budur@yahoo.com, Boğaziçi University |
Files
Original bundle
1 - 4 of 4
No Thumbnail Available
- Name:
- squad-tr-dev-v1.0.0-excluded.json.gz
- Size:
- 375.23 KB
- Format:
- Unknown data format
No Thumbnail Available
- Name:
- squad-tr-dev-v1.0.0.json.gz
- Size:
- 644.86 KB
- Format:
- Unknown data format
No Thumbnail Available
- Name:
- squad-tr-train-v1.0.0-excluded.json.gz
- Size:
- 4.66 MB
- Format:
- Unknown data format
No Thumbnail Available
- Name:
- squad-tr-train-v1.0.0.json.gz
- Size:
- 8.36 MB
- Format:
- Unknown data format
License bundle
1 - 1 of 1