以下为卖家选择提供的数据验证报告:
数据描述
The notebook that generates this dataset is here: https://www.kaggle.com/johnjdavisiv/us-counties-weather-sociohealth-location-data
For an introduction to the data, check out this notebook: https://www.kaggle.com/johnjdavisiv/intro-to-the-us-counties-covid19-data
The 3,142 counties of the United States span a diverse range of social, economic, health, and weather conditions. Because of the COVID19 pandemic, over 2,400 of these counties have already experienced some COVID19 cases.
Combining county-level data on health, socioeconomics, and weather can help us address identify which populations are at risk for COVID19 and help prepare high-risk communities.
Temperature and humidity may affect the transmissibility of COVID19, but in the United States, warmer regions also tend to have markedly different socioeconomic and health demographics. As such, it's important to be able to control for factors like obesity, diabetes, access to healthcare, and poverty rates, since these factors themselves likely play a role in COVID19 transmission and fatality rates.
This dataset provides all of this information, formatted, cleaned, and ready for analysis. Most columns have little or no missing data. A small number have larger amounts of missing data; see the kernel that generated this dataset for details.
