雪碧瓜瓜

verify-tagTED Talk Transcripts (2006 - 2021)

educationsocial sciencetext

6

已售 0
16.14MB

数据标识:D17171621742190723

发布时间:2024/05/31

以下为卖家选择提供的数据验证报告:

数据描述

Context

This dataset had been created as a part of a Quantitative Text Analysis project I had to complete during my Masters' degree. I wanted to explore the content of TED talks (being a big TED fan myself). There were datasets available at the time, but most of them were outdated. Also, I wanted to try my hand at web scraping too! So, this data took shape 😄

Content

The complete details on the data acquisition process is available here: https://deepnote.com/@ramshankar-yadhunath/Scraping-TED-fRqC4ebhTRaNrtcOSrIXMQ. I would highly recommend having a read if you are interested in using web scraping as a data acquisition technique.

Acknowledgements

A big shoutout to the work by @rounakbanik with https://www.kaggle.com/rounakbanik/ted-talks. That was my starting point. Also, Vishal Gupta's https://github.com/The-Gupta/TED-Scraper is a useful resource.

data icon
TED Talk Transcripts (2006 - 2021)
6
已售 0
16.14MB
申请报告