## 引言与背景
情感是人类认知和行为的核心驱动力,理解情感的表达、影响及其在不同领域的作用机制,对于心理学研究、人机交互、内容分析等多个领域具有重要意义。随着自然语言处理技术的发展,基于大规模标注数据的情感分析成为研究热点,但现有的情感数据集往往局限于基本情感类型,难以满足复杂情感研究的需求。本数据集旨在填补这一空白,提供覆盖75种细粒度情感类型的大规模文本数据,为情感计算、心理学研究、智能对话系统等领域提供丰富的研究资源。
本数据集包含完整的三元组数据结构,每条记录由提示问题(prompt)、情感标签(emotion)和详细回复(response)三部分构成。提示问题围绕特定情感在不同领域的表现展开,涉及领导力、历史叙事、多样性管理、伦理决策、医疗健康、文化社会、团队协作、家庭关系、客户服务、技术创新等多个应用场景。回复内容则深入分析该情感的正反两方面影响,并结合具体案例进行阐述。这种结构化的数据形式不仅便于直接用于模型训练,也为定性分析提供了丰富的素材。
该数据集的核心价值在于其覆盖范围的广度和内容的深度。相较于传统情感数据集仅包含积极、消极、中性等基础分类,本数据集细化到75种具体情感类型,包括羡慕(Envy)、冷漠(Apathy)、忧郁(Melancholy)、好奇(Curiosity)、感激(Gratitude)等,能够支持更精细的情感识别和分析任务。同时,每条记录的回复内容均经过精心撰写,包含多维度的分析视角,为研究者提供了深入理解情感机制的宝贵资源。
## 数据基本信息
### 数据字段说明
| 字段名称 | 字段类型 | 字段含义 | 数据示例 | 完整性 |
|---------|---------|---------|---------|--------|
| prompt | string | 情感相关的提示问题,引导对特定情感在某领域作用的分析 | Examine how Envy plays a role in leadership, management, and organizational behavior. | 100% |
| emotion | string | 情感类型标签,共75种细粒度情感分类 | Envy | 100% |
| response | string | 针对提示问题的详细回复,包含情感影响的正反分析及案例 | Envy is a complex emotion that can have both positive and negative effects on leadership... | 100% |
### 数据分布情况
#### 情感类型分布
本数据集共涵盖75种情感类型,各类型记录数分布相对均衡,最低3,723条,最高4,023条。以下为部分主要情感类型的分布情况:
| 情感类型 | 记录数量 | 占比 |
|---------|---------|------|
| Euphoria | 4,023 | 1.40% |
| Shame | 3,987 | 1.39% |
| Intrigue | 3,979 | 1.38% |
| Playfulness | 3,967 | 1.38% |
| Embarrassment | 3,966 | 1.38% |
| Sympathy | 3,960 | 1.38% |
| Delight | 3,929 | 1.37% |
| Generosity | 3,926 | 1.37% |
| Pride | 3,916 | 1.36% |
| Wonder | 3,908 | 1.36% |
| Disappointment | 3,904 | 1.36% |
| Optimism | 3,901 | 1.36% |
| Elation | 3,900 | 1.36% |
| Sadness | 3,898 | 1.36% |
| Frustration | 3,892 | 1.35% |
| Love | 3,884 | 1.35% |
| Anger | 3,881 | 1.35% |
| Inspiration | 3,877 | 1.35% |
| Empathy | 3,874 | 1.35% |
| Tranquility | 3,868 | 1.35% |
#### 情感类型分类统计
根据情感的性质和特点,可将75种情感类型分为以下几大类:
| 情感类别 | 包含情感举例 | 记录数量 | 占比 |
|---------|-------------|---------|------|
| 积极情感 | Joy, Hope, Gratitude, Pride, Delight | ~56,000 | ~19.5% |
| 消极情感 | Anger, Fear, Sadness, Shame, Guilt | ~57,000 | ~19.8% |
| 复杂情感 | Envy, Jealousy, Sympathy, Empathy | ~58,000 | ~20.2% |
| 认知情感 | Curiosity, Intrigue, Confusion, Wonder | ~58,000 | ~20.2% |
| 社交情感 | Love, Affection, Generosity, Kindness | ~58,000 | ~20.3% |
### 数据规模与特征
- 总记录数:287,280条
- 情感类型数:75种
- 平均回复长度:约2,500-3,000字符
- 数据格式:JSON Lines格式(每行一个JSON对象)
- 文件大小:约716MB
- 语言:英文
## 数据优势
| 优势特征 | 具体表现 | 应用价值 |
|---------|---------|---------|
| 覆盖范围广 | 包含75种细粒度情感类型,远超传统数据集的基础分类 | 支持细粒度情感识别、情感分类、情感演化等复杂任务 |
| 数据量大 | 287,280条高质量标注记录,规模庞大 | 满足深度学习模型训练需求,提升模型泛化能力 |
| 内容深度高 | 每条回复包含情感影响的正反分析及具体案例 | 支持情感理解、情感推理、情感生成等高级任务 |
| 领域多样性 | 提示问题涉及领导力、医疗、教育、技术等多个领域 | 支持跨领域情感分析、领域自适应模型训练 |
| 结构规整 | 统一的三元组结构(prompt-emotion-response) | 便于数据预处理、特征提取和模型输入构建 |
| 标注质量高 | 情感标签准确,回复内容逻辑清晰、分析深入 | 保证模型训练的可靠性和结果的可信度 |
| 平衡分布 | 各情感类型记录数均衡(3,700-4,000条) | 避免模型偏向高频情感,提升分类公平性 |
## 数据样例
以下为数据集的元数据样例,涵盖多种情感类型和应用场景:
样例1:Envy(羡慕) - Prompt: Examine how Envy plays a role in leadership, management, and organizational behavior. - Response: Envy is a complex emotion that can have both positive and negative effects on leadership, management, and organizational behavior. While envy can sometimes lead to negative outcomes such as conflict, resentment, and sabotage, it can also foster healthy competition, motivation, and learning...样例2:Apathy(冷漠) - Prompt: Assess the role of Apathy in shaping historical narratives and national identity. - Response: Apathy, defined as a lack of interest, enthusiasm, or concern, can have both positive and negative effects on shaping historical narratives and national identity...样例3:Curiosity(好奇) - Prompt: Evaluate the effect of Curiosity on navigating ethical dilemmas and moral decision-making. - Response: Curiosity is a natural human trait that drives individuals to seek knowledge and understanding of the world around them. It can have a significant impact on navigating ethical dilemmas...样例4:Gratitude(感激) - Prompt: Evaluate the impact of Gratitude on health care decisions and patient care. - Response: Studies have shown that practicing gratitude can have a significant impact on healthcare decisions and patient care. Patients who express gratitude towards their healthcare providers have been found to have better health outcomes...样例5:Kindness(善良) - Prompt: Study the impact of Kindness on handling customer complaints and dispute resolution. - Response: Kindness, empathy, and understanding are essential qualities for handling customer complaints and dispute resolution. When customers experience issues with a product or service, kind and empathetic responses can go a long way in resolving the issue...样例6:Euphoria(愉悦) - Prompt: Consider the impact of Euphoria on the development of virtual reality experiences. - Response: Euphoria, a physics engine technology developed by NaturalMotion, has had a significant impact on the development of virtual reality experiences...样例7:Jealousy(嫉妒) - Prompt: Examine how Jealousy affects motivation and drive in achieving personal and team goals. - Response: Jealousy, defined as the feeling of resentment and insecurity arising from the perceived superiority or advantage of others, can have both positive and negative effects on motivation...样例8:Delight(喜悦) - Prompt: Explore the impact of Delight on familial relationships and parenting styles. - Response: Delight is a concept that refers to the positive emotions and experiences that individuals have in their relationships, particularly in close relationships such as those between family members...样例9:Inspiration(灵感) - Prompt: Assess how Inspiration shapes the field of synthetic biology and biotech innovations. - Response: Inspiration plays a crucial role in shaping the field of synthetic biology and biotech innovations. New ideas often lead to breakthrough concepts that drive innovation...样例10:Anticipation(期待) - Prompt: Evaluate the effect of Anticipation on the progression of smart city infrastructure and urban resilience. - Response: Anticipation, the ability to predict and prepare for future events, has a significant effect on the progression of smart city infrastructure and urban resilience...样例11:Admiration(钦佩) - Prompt: Analyze the impact of Admiration on personal resilience and adaptation to challenges. - Response: Admiration, defined as the feeling of deep respect and admiration for someone or something, can have a significant impact on personal resilience and adaptation to challenges...样例12:Sympathy(同情) - Prompt: Assess how Sympathy shapes public attitudes towards genetic modification and cloning. - Response: Sympathy is a crucial factor in shaping public attitudes towards genetic modification and cloning. Empathy for human and animal welfare plays a significant role in public opinion...样例13:Introspection(内省) - Prompt: Explore how Introspection interacts with cultural, social, and individual factors in shaping behavior. - Response: Introspection, the process of examining one's thoughts, feelings, and motivations, is a complex phenomenon that interacts with cultural, social, and individual factors...样例14:Surprise(惊讶) - Prompt: Consider the impact of Surprise on the development of virtual reality experiences. - Response: Surprise is a powerful emotion that can greatly impact the development of virtual reality experiences. Unexpected events and sensory inputs can enhance immersion...样例15:Amusement(娱乐) - Prompt: Examine how Amusement affects the ability to inspire and motivate others. - Response: Amusement, as a form of entertainment, has the potential to inspire and motivate others through creativity, positivity, and empathy...样例16:Contentment(满足) - Prompt: Evaluate the effect of Contentment on career development and work-life balance. - Response: Contentment, a state of happiness and satisfaction, plays a significant role in career development and work-life balance. Individuals who feel content are more likely to maintain stable performance...样例17:Fear(恐惧) - Prompt: Assess how Fear influences decision-making in high-stakes business environments. - Response: Fear is a powerful emotion that significantly influences decision-making in high-stakes business environments. While fear can lead to risk aversion and conservative choices, it can also motivate careful analysis...样例18:Hope(希望) - Prompt: Explore the impact of Hope on recovery from physical and mental health challenges. - Response: Hope is a vital factor in recovery from physical and mental health challenges. Studies show that individuals with higher levels of hope tend to have better treatment outcomes...样例19:Loneliness(孤独) - Prompt: Study the effect of Loneliness on creative thinking and artistic expression. - Response: Loneliness can have complex effects on creative thinking and artistic expression. While it may lead to introspection and deep reflection, it can also hinder collaborative creativity...样例20:Determination(决心) - Prompt: Analyze how Determination contributes to entrepreneurial success and innovation. - Response: Determination is a key driver of entrepreneurial success and innovation. Entrepreneurs who demonstrate strong determination are more likely to overcome obstacles and persist through challenges...## 应用场景
### 情感分析模型训练
本数据集最直接的应用场景是情感分析模型的训练和优化。传统情感分析往往局限于积极、消极、中性等基础分类,而本数据集提供的75种细粒度情感类型,能够支持更精细的情感识别任务。研究者可以利用该数据集训练多标签情感分类模型,实现对文本中多种情感的同时识别。例如,在社交媒体分析中,模型可以同时识别用户表达的羡慕、嫉妒、同情等多种复杂情感,为舆情监测和用户画像提供更丰富的维度。此外,数据集的三元组结构(prompt-emotion-response)也适用于情感生成任务,训练模型根据给定的情感和场景生成连贯的文本回复,为智能对话系统提供支持。
### 心理学研究与情感机制分析
该数据集为心理学研究提供了丰富的文本语料,可用于分析不同情感在各种场景下的表现形式和影响机制。研究者可以通过文本挖掘技术,提取情感表达的语言特征,分析不同情感的语义关联和语境依赖。例如,通过对比Envy和Jealousy在领导场景中的表达差异,可以深入理解这两种相似情感的细微区别。此外,数据集中包含的正反分析结构,为研究情感的双重性提供了天然素材,有助于揭示情感在不同情境下的适应性功能。这种基于大规模文本数据的分析方法,能够弥补传统实验心理学样本量有限的不足,为情感科学研究提供新的视角和方法。
### 人机交互与用户体验优化
在人机交互领域,理解用户情感是提升系统智能性和亲和力的关键。本数据集可以用于训练情感感知模型,使智能系统能够识别用户输入中的情感状态,并做出相应的情感响应。例如,在智能客服系统中,当检测到用户表达Frustration(沮丧)或Anger(愤怒)时,系统可以调整回复策略,采用更耐心、安抚性的语言;当检测到用户表达Gratitude(感激)时,系统可以强化积极反馈,提升用户满意度。此外,数据集中丰富的情感表达范例,也可以作为情感对话生成的参考,帮助构建更自然、更具情感共鸣的人机交互体验。
### 内容分析与舆情监测
在内容分析和舆情监测领域,该数据集可以支持对大规模文本数据的深度情感挖掘。例如,在新闻媒体分析中,可以识别不同报道中蕴含的情感倾向,分析媒体如何通过情感表达影响公众舆论;在产品评论分析中,可以细粒度地识别用户对产品不同方面的情感反应,为产品改进提供精准反馈。此外,数据集覆盖的多领域特征(领导力、医疗、技术、教育等),使得模型能够适应不同领域的情感表达特点,实现跨领域的情感分析应用。
### 教育与培训应用
本数据集还可以应用于教育和培训领域,特别是在情商培养和沟通技巧训练方面。数据集中包含的情感分析案例,可以作为教学素材,帮助学习者理解不同情感在各种情境下的表现和影响。例如,通过学习Kindness(善良)在客户服务中的应用案例,学习者可以提升服务意识和沟通技巧;通过分析Empathy(同理心)在团队管理中的作用,管理者可以更好地理解团队成员的情感需求,提升团队凝聚力。此外,数据集还可以用于构建情感模拟训练系统,为学习者提供虚拟的情感互动场景,增强情感识别和应对能力。
## 结尾
本数据集以其大规模、细粒度、多领域的特点,为情感分析、心理学研究、人机交互等领域提供了宝贵的研究资源。287,280条高质量标注记录覆盖75种情感类型,不仅数量庞大,更在内容深度和标注质量上具有显著优势。每条记录包含精心设计的提示问题和深入分析的回复内容,为模型训练和定性研究提供了丰富的素材。
该数据集的核心价值在于其细粒度的情感分类和多样化的应用场景覆盖。相较于传统情感数据集,它能够支持更复杂的情感分析任务,推动情感计算领域向更精细、更深入的方向发展。同时,数据集中蕴含的丰富情感表达范例和分析视角,也为心理学研究提供了新的方法论和研究思路。
数据集采用JSON Lines格式存储,便于程序处理和大规模数据分析。研究人员可以直接使用该数据集进行模型训练、特征提取和情感挖掘,也可以根据具体需求进行数据筛选和转换。无论是学术研究还是工业应用,该数据集都具有广泛的应用前景和重要的参考价值。
如需获取更多关于数据集的详细信息或使用指导,可私信获取进一步支持。
看了又看
验证报告
以下为卖家选择提供的数据验证报告:






