麻酱

verify-tag190K+ Spam | Ham Email Dataset for Classification

beginnernlpclassificationbinary classificationtext classification

27

已售 0
341.11MB

数据标识:D17171246806270046

发布时间:2024/05/31

以下为卖家选择提供的数据验证报告:

数据描述

Spam or Ham Email Classification Dataset

Overview

This dataset contains over 190,000+ emails labeled as either spam or ham (non-spam). Each email is represented by its text content along with its corresponding label.

Description

The dataset provides a comprehensive collection of emails, categorized as either spam or ham, intended to facilitate research and development in email classification algorithms. With a vast corpus of emails, this dataset offers ample opportunities for training and evaluating machine learning models for effective spam detection.

Features

  • Text: The content of the email.
  • Label: The classification label indicating whether the email is spam (1) or ham (0).

Usage

Researchers and practitioners can leverage this dataset to:

  • Develop and evaluate machine learning models for email classification.
  • Explore natural language processing techniques for spam detection.
  • Conduct comparative studies on the effectiveness of different classification algorithms.
  • Investigate emerging trends and patterns in email spamming behavior.
data icon
190K+ Spam | Ham Email Dataset for Classification
27
已售 0
341.11MB
申请报告