FoodSky: A Food-oriented Large Language Model, and FoodEarth: A Foundamental Food Corpus and Instruction Dataset
Authors/Creators
Contributors
Project member:
Description
Food is the cornerstone of both survival and social life. With the increasing complexity of global dietary needs and preferences, there is a growing demand for food intelligence to enable tasks like recipe recommendation and diet-disease correlation discovery. To address this, we introduce the Food-oriented Large Language Model (LLM) FoodSky, which offers fine-grained perception and reasoning of food data. We constructed a food corpus, FoodEarth, from various authoritative sources to enhance FoodSky's knowledge. We also developed the Topic-based Selective State Space Model and Hierarchical Topic Retrieval Augmented Generation algorithms to improve FoodSky's ability to capture fine-grained food semantics and generate context-aware food-relevant text. Extensive experiments show that FoodSky outperforms general-purpose LLMs on the Chinese National Chef Exam and Dietetic Exam, achieving accuracies of 67.2% and 66.4%, respectively. FoodSky not only enhances culinary creativity and promotes healthier eating patterns but also establishes a new standard for domain-specific LLMs tackling real-world food-related issues.
Files
Additional details
Software
- Repository URL
- https://github.com/LanceZPF/FoodSky
- Programming language
- Python
References
- @article{zhou2024foodsky, title={FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination}, author={Zhou, Pengfei and Min, Weiqing and Fu, Chaoran and Jin, Ying and Huang, Mingyu and Li, Xiangyang and Mei, Shuhuan and Jiang, Shuqiang}, journal={arXiv preprint arXiv:2406.10261}, year={2024} }