以下为卖家选择提供的数据验证报告:
数据描述
Context
This dataset had been created as a part of a Quantitative Text Analysis project I had to complete during my Masters' degree. I wanted to explore the content of TED talks (being a big TED fan myself). There were datasets available at the time, but most of them were outdated. Also, I wanted to try my hand at web scraping too! So, this data took shape 😄
Content
The complete details on the data acquisition process is available here: https://deepnote.com/@ramshankar-yadhunath/Scraping-TED-fRqC4ebhTRaNrtcOSrIXMQ. I would highly recommend having a read if you are interested in using web scraping as a data acquisition technique.
Acknowledgements
A big shoutout to the work by @rounakbanik with https://www.kaggle.com/rounakbanik/ted-talks. That was my starting point. Also, Vishal Gupta's https://github.com/The-Gupta/TED-Scraper is a useful resource.
