以下为卖家选择提供的数据验证报告:
数据描述
Description
INSEE (French national institute of statistics) realeased two datasets reporting the names given to French babies since 1900 : one at the national level and one at the departmental level. This dataset allows to track trends in how French babies are named since 1900, explore the proportion of male/female in population over times, find the more gender neutral names, and more.
What differences between those two datasets and the original ones released by INSEE ?
I've taken the raw files from INSEE and made a little shaping and cleaning work:
- Data are cleaned from missing values in the
year
column. - Unusual names (less than 20 babies born in the same year) are dropped.
- Columns names are translated from french to english.
- Department names are added on the department-level dataset (column
dpt_name
).
Plot example

French Baby Names
17.94MB
申请报告