老下头

verify-tagLizard dataset

biology

19

已售 0
750.11MB

数据标识:D17174979329724991

发布时间:2024/06/04

以下为卖家选择提供的数据验证报告:

数据描述

The development of deep segmentation models for computational pathology (CPath) can help foster the investigation of interpretable morphological biomarkers. Yet, there is a major bottleneck in the success of such approaches because supervised deep learning models require an abundance of accurately labelled data. This issue is exacerbated in the field of CPath because the generation of detailed annotations usually demands the input of a pathologist to be able to distinguish between different tissue constructs and nuclei. Manually labelling nuclei may not be a feasible approach for collecting large-scale annotated datasets, especially when a single image region can contain thousands of different cells. Yet, solely relying on automatic generation of annotations will limit the accuracy and reliability of ground truth. Therefore, to help overcome the above challenges, we propose a multi-stage annotation pipeline to enable the collection of large-scale datasets for histology image analysis, with pathologist-in-the-loop refinement steps. Using this pipeline, we generate the largest known nuclear instance segmentation and classification dataset, containing nearly half a million labelled nuclei in H&E stained colon tissue. We will publish the dataset and encourage the research community to utilise it to drive forward the development of downstream cell-based models in CPath.

Link to the dataset paper.

Citation

@inproceedings{graham2021lizard,   title={Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation and Classification},   author={Graham, Simon and Jahanifar, Mostafa and Azam, Ayesha and Nimir, Mohammed and Tsang, Yee-Wah and Dodd, Katherine and Hero, Emily and Sahota, Harvir and Tank, Atisha and Benes, Ksenija and others},   booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},   pages={684--693},   year={2021} } 

Acknowledgements

We would like to acknowledge the following institutions, where the images in this dataset originated from:

  • University Hospitals Coventry and Warwickshire, United Kingdom
  • Histo Pathology Diagnostic Center, Shanghai, China
  • Ruijin Hospital, Shanghai, China
  • Xijing Hospital, Xi'an, China
  • Shanghai Songjiang District Central Hospital, Shanghai, China
  • The National Cancer Institute (NCI), United States of America
data icon
Lizard dataset
19
已售 0
750.11MB
申请报告