以下为卖家选择提供的数据验证报告:
数据描述
This dataset contains 5 files related to information on Gardening and Landscaping from Gardening & Landscaping Stack Exchange community.
- QueryResults.csv - list of questions and accepted answers from the community upto 31st December 2023
- question_embeddings.pickle - a binary file format that contains text embeddings generated using the textembedding@gecko003 model on Vertex AI for all questions (a combination of text in question Title and Body)
question_text_only_embeddings.pickle - a binary file format that contains text embeddings generated using the textembedding@gecko003 model on Vertex AI for all questions (a combination of text in question Title and Body) after removing HTML and newline characters.(checking for some missing values)- TagCounts.csv - list of tags used for labeling the questions by community members (each question can be tagged with multiple tags)
- tag_embeddings.pickle - a binary file format that contains text embeddings generated using the textembedding@gecko003 model on Vertex AI for all tags
The data was downloaded from the Stack Exchange Data Explorer. All Stack Exchange user contributions are licensed under CC-BY-SA 3.0 with attribution required.

Gardening Q&A with text embeddings
57.53MB
申请报告