Chinese Couplets/chinese Zodiac Signs/city Attractions/travel Plans... A Collection of Datasets With a Strong Chinese New Year Flavor

On the third day of the first lunar month, the New Year atmosphere is in full swing. The festive atmosphere of the New Year is not only permeating the streets and alleys, but also triggering many innovative applications in the field of artificial intelligence. From AI-generated Spring Festival couplets, to intelligent recognition of Spring Festival traffic signs, to in-depth mining of zodiac culture, AI tools are empowering traditional folk customs and making the New Year more festive!
In this festive and peaceful day of family reunion, HyperAI has compiled 8 Spring Festival-related data sets for everyone, covering popular topics such as couplets, the twelve zodiac signs, and ethnic culture, to help developers make great strides in AI-enabled Spring Festival! Come and get your "Spring Festival Gift Pack"~
Click to view more open source datasets:
Scan the QR code and remark "dataset" to join the discussion group↓

Summary of Spring Festival Dataset
1. Chinese Couplet Dataset
Estimated size:25.62 MB
Download address:https://go.hyper.ai/ULbsO
This dataset contains about 740k couplets. fixed_couplets_in.txt is the upper couplet and fixed_couplets_out.txt is the lower couplet.
2. Traffic Sign Image Dataset
Estimated size:184.71 MB
Download address:https://go.hyper.ai/UgwHq
The dataset contains 5,998 traffic sign images from 58 categories. Each image is a zoomed view of a single traffic sign. The annotations provide image properties (filename, width, height) as well as traffic sign coordinates within the image and category (e.g. 5 km/h speed limit).
3. Street view traffic sign dataset
Estimated size:15.96 GB
Download address:https://go.hyper.ai/bST8U
The dataset consists of 9,898 street view images. Each photo contains at least one or more traffic signs, and the traffic sign coordinates and categories are annotated. The data comes from the China Traffic Sign Detection Database.
4. Chinese Zodiac Dataset
Estimated size:2.07 GB
Download address:https://go.hyper.ai/zfL09
The dataset contains 8,508 images of the twelve Chinese zodiac signs. The dataset has been pre-split into training, validation, and testing with a ratio of 85:7.5:7.5.
5. Multi-needle embroidery dataset
Estimated size:1000.37 MB
Download address:https://go.hyper.ai/Ywfip
This dataset is the first embroidery dataset with detailed annotations of single stitch and multi-stitch labels. More than 30K images, including embroidery images and corresponding content images, were produced by professional embroidery software (Wilcom 9.0).
6. National Culture Fine-tuning Dataset
Estimated size:1.43 MB
Download address:https://go.hyper.ai/SSpo4
The dataset contains 151,159 data items, of which 10,000 are publicly available, covering seven major areas: architecture, clothing, crafts, food, etiquette, language, and customs.
7. Travel planning dataset
Estimated size:32.9 MB
Download address:https://go.hyper.ai/5haVS
The dataset contains 1,225 carefully curated planning intentions and reference plans, designed to test the planning capabilities of language agents in real-world scenarios. The dataset is based on travel planning and requires language agents to generate a comprehensive travel plan based on a given query, including transportation, daily meals, attractions, and accommodation.
8. City attractions dataset
Estimated size:22.34 MB
Download address:https://go.hyper.ai/RR9DP
This dataset contains tourist attraction data from 352 cities in China. Each city csv file contains 100 locations. The data includes the location name, website, address, attraction introduction, opening hours, image website, rating, recommended visit duration, recommended visit season, ticket information, tips, etc.
The above is the Spring Festival-related data set compiled by HyperAI. If you have resources that you want to include on the hyper.ai official website, you are welcome to leave a message or submit a contribution to tell us!
About HyperAI
HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China.We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:
* Provide domestic accelerated download nodes for 1300+ public data sets
* Includes 400+ classic and popular online tutorials
* Interpretation of 200+ AI4Science paper cases
* Support 500+ related terms search
* Hosting the first complete Apache TVM Chinese documentation in China
Visit the official website to start your learning journey:
