数据描述
Context
In order to produce cancer estimates in Brazil, the governmet, more specificly the National Cancer Intitute (INCA), has systematic centers for collection of data. They are known as RCBP (Cancer Registers with Populational Basis). This data is in accordance with regional laws and can be required by anyone.
Here I translated the variables in order to help in any analysis, but most of the values are not translated due to lazyness. However almost every term is translatable using google or part of a international code system (CID-10 -- classification of diseases -- or CID-O3 -- classification of cancers having in mind topography and morphology). More about the terms can be seen here (unfortunentely this document is in portuguese): www.inca.gov.br/publicacoes/manuais/manual-de-rotinas-e-procedimentos-para-registros-de-cancer-de-base-populacional
Moreover I added estimated populational data of almost all cities in Brazil. This data is produced by IBGE and was organized bt Ricardo Dahis (email: rdahis@basedosdados.org | github_user: rdahis | website: www.ricardodahis.com | ckan_user: rdahis) and can be dowloaded again here https://basedosdados.org/dataset/br-ibge-populacao
Content
This data is quite organized, however it has some flaws:
- RCBP were added throughout the time
- People do not always are treated in their state, so ratios can be implicated by it
- It seems that there is a lack of data from 2013-2019
Even though, this is the best dataset possible in terms of what is happening in cancer in Brazil!
Acknowledgements
This dataset was entirely produced by INCA and I only translated some terms and replaced strings that meant NA for NA.
What should you do with it?
There are some questions that I believe that can be answerd
- Which cancers are more incident in which population/sub-populations ?
- Which cancers are had their survival rate enhanced?
- Do people treat their cancers in their state or they go to other states? is there any trends related to that?
- Do some centers treat their patients better than others? (is their big differences in outcome depening on where the person was diagnosed)
- How badly do people fill these forms? (How much NA their is? How much unspecific? Which variables are simply unusable?)
验证报告
以下为卖家选择提供的数据验证报告:
