Dataset:
Scientific Abstracts Corpus

dc.contributor.authorÖztürk, Seçil
dc.contributor.authorSankur, Bülent
dc.contributor.authorGüngör, Tunga
dc.contributor.authorYılmaz, Mustafa Berkay
dc.contributor.authorKöroglu, Bilge
dc.contributor.authorAğın, Onur
dc.contributor.authorİşbilen, Mustafa
dc.contributor.authorUlaş, Çağdaş
dc.contributor.authorAhat, Mehmet
dc.date.accessioned2023-03-03T22:22:48Z
dc.date.available2023-03-03T22:22:48Z
dc.date.issued2014-04-01
dc.descriptionThe dataset is a labeled text corpus in 35 academic disciplines compiled from journals and conference proceedings. For each discipline, 200 papers were compiled. Each text includes the topic, name of the resource, title of the paper, abstract, and keywords (if available). The corpus consists of 34 xml files where each file corresponds to a discipline. Each xml file contains information about 200 papers. Information for a paper has the following format: <makale> <Etiket>discipline</Etiket> <Başlık>paper title</Başlık> <Özetçe>paper abstract</Özetçe> <Anahtar>keywords separated by commas</Anahtar> <Kaynak>journal/conference name</Kaynak> <TürkçeKarakter>Sorunsuz/Sorunlu</TürkçeKarakter> </makale> Example: <makale> <Etiket>Arkeoloji</Etiket> <Başlık>Burdur Bölgesi Neolitik Çağ Mimarlığı ve Anadolu'daki Çağdaşları Arasındaki Konumu Hakkında</Başlık> <Özetçe> Bu makalede yeni kazılarda elde edilen bilgiler ışığında Burdur yöresinde yaklaşık 2000 yıl (İÖ 7000 - 5300) süren Neolitik Çağ boyunca mimaride gözlenen özellikleri irdeleyeceğiz. ... </Özetçe> <Anahtar></Anahtar> <Kaynak>Adalya - Akdeniz Medeniyetleri Araştırma Enstitüsü Yıllığı</Kaynak> <TürkçeKarakter>Sorunsuz</TürkçeKarakter> </makale>
dc.description.sponsorshipTÜBİTAK, 3120918, TEYDEB, nationalFunds
dc.identifier.urihttps://tulap.cmpe.boun.edu.tr/handle/20.500.12913/78
dc.language.isoTurkish
dc.publisherBoğaziçi University
dc.relation.isreferencedbyhttps://ieeexplore.ieee.org/abstract/document/6830499
dc.rightsApache License 2.0
dc.rights.urihttp://opensource.org/licenses/Apache-2.0
dc.subjectScientific papers
dc.subjectAbstract
dc.subjectText classification
dc.titleScientific Abstracts Corpus
dc.typecorpus
dspace.entity.typeDataset
local.contact.personTunga, Güngör, gungort@boun.edu.tr, Boğaziçi University
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Turkce_Makale_Derlemi.zip
Size:
3.26 MB
Format:
Unknown data format
Description:
Unknown
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Plain Text
Description:
Collections