以下为卖家选择提供的数据验证报告:
数据描述
Context
This dataset contains around 6k articles related to COVID-19 from 69 french-speaking news websites.
Content
- date_publish: Date of the publication. Some dates are inaccurate.
- title: Headline of the article.
- description: Short description of the article.
- maintext: Main content of the article. Some data are truncated because a subscription is required.
- url: URL of the article.
- labels: Categories of the article.
Acknowledgements
This dataset was collected from lemonde.fr, lefigaro.fr, liberation.fr, leparisien.fr, lesechos.fr, la-croix.com, lequipe.fr, slate.fr, latribune.fr, nouvelobs.com, lexpress.fr, marianne.net, francesoir.fr, leprogres.fr, lejdd.fr, linternaute.com, telerama.fr, bfmtv.com, lci.fr, francetvinfo.fr, boursorama.com, rtl.fr, clubic.com, huffingtonpost.fr, capital.fr, ledauphine.com, parismatch.com, europe1.fr, legorafi.fr, lalibre.be, lesoir.be, closermag.fr, elle.fr, esprit.presse.fr, sciencesetavenir.fr, politis.fr, caminteresse.fr, femmeactuelle.fr, nationalgeographic.fr, voici.fr, regards.fr, larecherche.fr, lhistoire.fr, journalmetro.com, dhnet.be, letemps.ch, levif.be, lesaffaires.com, lactualite.com, rtbf.be, franceinter.fr, lepetitjournal.com, lapresse.ca, futura-sciences.com, science-et-vie.com, pourlascience.fr, demotivateur.fr, buzzbeed.com, nordpresse.be, bopress.ma, secretnews.fr, letelegramme.fr, numerama.com, laprovence.com, ladepeche.fr, midilibre.fr, telestar.fr, courrierinternational.com and melty.fr.
Inspiration
- Study the media impact of COVID-19
- Analyze different writing styles
- Sentiment Analysis
- News generation
Citation
If you're using this dataset for research purposes, please use the following BibTex for citations:
@dataset{covidfrenchnews, author = {Gustave Cortal}, year = {2021}, month = {03}, title = {COVID-19: French news dataset}, url = {https://www.gustavecortal.com} }
