以下为卖家选择提供的数据验证报告:
数据描述
Context
This is a modified version of Preet Viradiya's dataset "Brian Tumor Dataset", but all the tumor images have been preprocessed, normalized, and the tumor location metadata has been manually gathered into a separate dataset.
The full preprocess sequence is detailed in the first half of this notebook in the original dataset: Brain tumor image preprocessing & clasifier
DISCLAIMER: I am no neuroscientist, so this data should only be used for practice purposes, as some of the tumor data location is bound to be inaccurate or plainly wrong.
Content
The data is split in two datasets:
image_df
contains 2500 separate 128x128px images of cancer brain scans, one in each row. Reshaping a row into a 128x128 array should be necessary in order to display it correctly.data_df
contains four integers per entry detailing the coordinates of the top left corner of the approximate rectangle containing the tumor in the same-index image. The following two values contain the rectangle width and height respectively.
An example of loading and displaying data from this dataset has been included into the notebooks section under the name Dataset Usage Basic Example.
Acknowledgements
Thanks to Preet Viradiya for providing the original images
Inspiration
This dataset's goal is to find a way to improve an hypotetical brain scan classificator. The question of "does this brain have cancer?" has been answered in the original dataset, using regression and this modified dataset not only the classification question question can be answered but also a model can be trained to point out exactly where the tumor is located.
