Dataset:
Turkish Multi-document Summarization (MDS) Corpus

dc.contributor.authorNuzumlalı, Muhammed Yavuz
dc.contributor.authorÖzgür, Arzucan
dc.date.accessioned2023-03-03T22:22:46Z
dc.date.available2023-03-03T22:22:46Z
dc.date.issued2014-10-01
dc.descriptionThe corpus includes four folders. The folder “clusters” holds the original documents that will be summarized in 21 subfolders. Each subfolder contains about 10 documents (multi-documents) related to the same topic. There are three different manually prepared summaries of these 21 topics in the folders “models1”, “models2”, and “models3”. Each of these summary folders contains 21 text files such that each text file is the multi-document summary of the documents in that topic. Example: The files in the folder clusters/1/: 1.txt: Zonguldak'ta ruhsatsız olduğu ileri sürülen … … 9.txt: Zonguldak'ta, ruhsatsız kömür ocağında … The file in the folder models1: 1: Zonguldak'ta yaşanan 2 ayrı maden kazasında … The file in the folder models2: 1: Zonguldak'ta çalışma ruhsatı olmayan … The file in the folder models3: 1: Zonguldak'ta ruhsatsız olduğu ortaya çıkan …
dc.identifier.urihttps://tulap.cmpe.boun.edu.tr/handle/20.500.12913/69
dc.language.isoTurkish
dc.publisherBoğaziçi University
dc.relation.isreferencedbyhttps://aclanthology.org/D14-1077.pdf
dc.rightsApache License 2.0
dc.rights.urihttp://opensource.org/licenses/Apache-2.0
dc.subjectMulti-document text summarization
dc.titleTurkish Multi-document Summarization (MDS) Corpus
dc.typecorpus
dspace.entity.typeDataset
local.contact.personArzucan, Özgür, arzucan.ozgur@boun.edu.tr, Boğaziçi University
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
TurkishMDSDataSet.zip
Size:
361.17 KB
Format:
Unknown data format
Description:
Unknown
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Plain Text
Description:
Collections