以下为卖家选择提供的数据验证报告:
数据描述
Context
This data is scraped from Bajaj Finserv website which contains lots of articles and blogs and product information regarding financial products like various kinds of insurance, loans, EMIs etc.
Content
The data that we have scraped and presented is as follows - All the paragraph and text that has been scraped are added in the file paras-and-lines-website-scraped.csv. All the tweets that were scraped are added in another CSV file. All the URLs of the blog webpages present in the website are present in another CSV file.
Acknowledgements
We would like to thank Bajaj Finserv for providing the data for our research.
Inspiration
Using this large corpus of text on a specific domain i.e. financial products, we are trying to create a recommendation system that recommends the web pages that are most closely related to a keyword search.
