以下为卖家选择提供的数据验证报告:
数据描述
Context
I decided to perform a multi-label classification task. It can classify an abstract within multiple keywords. The main goal is to categorize some optimized keywords based on papers abstract to show the paper to the relatable users. That's why I scraped IEEE. I fetched papers abstract and the keywords. Then performed necessary cleaning and tasks.
- The scraper program for data collection is available at Scientific Papers Keywords Categorization git repo.
- The classifier integration is live here.
Dataset Containings
- abstracts: The abstract of the paper.
- ieee_keywords: The given IEEE keywords.
- author_keywords: The keywords given by the author.
File Name: [All the .csv files within scraped_data folder]
- abstracts: Th abstract of the paper.
- final_keywords: It contains the unique keywords from author and ieee keywords.
File Name: paper_details_final_data.csv
Acknowledgements
I used IEEE to scrape the dataset. If you use the data research purpose, kindly add a citation.
Inspiration
This dataset can be used for natural language processing tasks mostly for single-label and multi-label classifications.

IEEE Papers Abstract and Keywords
29.91MB
申请报告