以下为卖家选择提供的数据验证报告:
数据描述
Context
The Myers-Briggs Type Indicators (MBTI) is one of the most popular personality model which creates a binary categorization based on four different dimensions and produces 16 possible personality type depending on the combination of these four values.
• Introversion/Extraversion: The first dimension is connected to the person's energy. Extroverts prefer to put their energy into dealing with people, circumstances, or the outer world. Introverts prefer to focus their energies on dealing with ideas, facts, explanations, or beliefs, or the inner world.
• iNtuition /Sensing: The second dimension is about how information is processed. Sensing is the type of person who wants to look at facts, with what is known. A person with iNtuition tends to experiment with ideas, investigate the unknown.
• Feeling /Thinking: The third dimension is related to decision making. Thinking people make decisions based on objective reasoning and an independent perspective. The type of people who prefer to use values is Feeling.
• Perception /Judgement: The final dimension is related to the way of life chosen. Judgment is the type of person that prefers to have their lives planned and structured. Perception is the type of person that prefers to go with the flow, to be adaptable, and to respond to events as they occur.
Content
This dataset contains over 7800 rows of data, on each row is a user’s: Type (This persons 4 letter MBTI code/type) Each entry separated by "|||" (3 pipe characters)
Acknowledgements
The dataset was obtained from Twitter through the use of TwitterAPI. To label users, search phrases such as "I am an...", "My MBTI is...", and "My personality type is..." were employed. Subsequently, for all personality types, data was collected through TwitterAPI queries. Users were then selected based on their self-reported personality types, and their timelines were utilized to collect tweets. To ensure data quality, only users who shared more than 200 words were included in the dataset. Throughout the data collection process, precision was given priority. It is important to recognize that despite the emphasis on precision in the data collection process, there may be biases in the dataset due to the search phrases used and the self-reported personality types of the users included. Ethical concerns surrounding the collection and utilization of data from social media platforms, such as Twitter, should also be taken into consideration.
