小KE YA🌸

verify-tagAmazon ML Challenge: 2023

data cleaningtextbigqueryregression

18

已售 0
860.69MB

数据标识:D17193990011583320

发布时间:2024/06/26

以下为卖家选择提供的数据验证报告:

数据描述

In this hackathon, the goal was to develop a machine learning model that can predict the length dimension of a product. Product length is crucial for packaging and storing products efficiently in the warehouse. Moreover, in many cases, it is an important attribute that customers use to assess the product size before purchasing. However, measuring the length of a product manually can be time-consuming and error-prone, especially for large catalogs with millions of products.

You will have access to the product title, description, bullet points, product type ID, and product length for 2.2 million products to train and test your submissions. Note that there is some noise in the data.

Task

You are required to build a machine learning model that can predict product length from catalog metadata.

Dataset description

The dataset folder contains the following files:

train.csv: 2249698 x 6 test.csv: 734736 x 5 sample_submission.csv: 734736 x 2

The dataset belongs to: Amazon India.

data icon
Amazon ML Challenge: 2023
18
已售 0
860.69MB
申请报告