verify-tagSatellite Image Caption Generation

agriculturecomputer visionimagetextimage-to-text

1

已售 0
119.36MB

数据标识:D17222506198435326

发布时间:2024/07/29

以下为卖家选择提供的数据验证报告:

数据描述

About the dataset

This dataset contains a around 10K satellite images split into 3 folders -

  • 8734 train images
  • 1093 test images
  • 1094 validation images

Each of the images are accompanied by 5 different captions - annotated by 5 different annotators .

*For example : *

Sample image 1

Has 5 captions describing it

  • "'Many planes are near a large building in an airport.'",
  • "'the terminal building with a large number of aircraft is surrounded by several tracks.'",
  • "'the terminal building wih a great many of airplanes aisde is surrounded by several runways .'",
  • "'Many planes are close to a large building at an airport.'",
  • "'many planes are near a large building in an airport .'"

Sample image 2

-"'There is a large playground in the gym.'",

  • "'Many cars were parked outside the stadium.'",
  • "'many cars were parked outside the stadium .'",
  • '"There's a big playground in the gym."',
  • "'there is a big playground in the gym .'

Here's a starter kernel to view the images - https://www.kaggle.com/code/tomtillo/viewing-the-images-and-captions

See more details in these two papers https://arxiv.org/pdf/1712.07835.pdf https://www.sciencedirect.com/science/article/pii/S1877050920300752

Potential uses of this dataset

  1. Automatic Caption generation

Train the images and the corresponding captions using any of the CLIP models

  1. Semantic search using embedding
data icon
Satellite Image Caption Generation
1
已售 0
119.36MB
申请报告