1.数据集介绍
数据集的格式如下:
id :arXiv ID,可⽤于访问论⽂;
submitter :论⽂提交者;
authors :论⽂作者;
title :论⽂标题;
comments :论⽂⻚数和图表等其他信息;
journal-ref :论⽂发表的期刊的信息;
doi :数字对象标识符,https://www.doi.org;
report-no :报告编号;
categories :论⽂在 arXiv 系统的所属类别或标签;
license :⽂章的许可证;
abstract :论⽂摘要;
versions :论⽂版本;
authors_parsed :作者的信息
2.arxiv论⽂类别
'astro-ph': 'Astrophysics',
'astro-ph.CO': 'Cosmology and Nongalactic Astrophysics',
'astro-ph.EP': 'Earth and Planetary Astrophysics',
'astro-ph.GA': 'Astrophysics of Galaxies',
'cs.AI': 'Artificial Intelligence',
'cs.AR': 'Hardware Architecture',
'cs.CC': 'Computational Complexity',
'cs.CE': 'Computational Engineering, Finance, and Science',
'cs.CV': 'Computer Vision and Pattern Recognition',
'cs.CY': 'Computers and Society',
'cs.DB': 'Databases',
'cs.DC': 'Distributed, Parallel, and Cluster Computing',
'cs.DL': 'Digital Libraries',
'cs.NA': 'Numerical Analysis',
'cs.NE': 'Neural and Evolutionary Computing',
'cs.NI': 'Networking and Internet Architecture',
'cs.OH': 'Other Computer Science',
'cs.OS': 'Operating Systems











网友评论