There is a newer version of the record available.

Published September 25, 2024 | Version v2
Dataset Restricted

FoodSky: A Food-oriented Large Language Model, and FoodEarth: A Foundamental Food Corpus and Instruction Dataset

  • 1. ROR icon Institute of Computing Technology
  • 2. ROR icon Chinese Academy of Sciences

Contributors

Project member:

Description

Food is the cornerstone of both survival and social life. With the increasing complexity of global dietary needs and preferences, there is a growing demand for food intelligence to enable tasks like recipe recommendation and diet-disease correlation discovery. To address this, we introduce the Food-oriented Large Language Model (LLM) FoodSky, which offers fine-grained perception and reasoning of food data. We constructed a food corpus, FoodEarth, from various authoritative sources to enhance FoodSky's knowledge. We also developed the Topic-based Selective State Space Model and Hierarchical Topic Retrieval Augmented Generation algorithms to improve FoodSky's ability to capture fine-grained food semantics and generate context-aware food-relevant text. Extensive experiments show that FoodSky outperforms general-purpose LLMs on the Chinese National Chef Exam and Dietetic Exam, achieving accuracies of 67.2% and 66.4%, respectively. FoodSky not only enhances culinary creativity and promotes healthier eating patterns but also establishes a new standard for domain-specific LLMs tackling real-world food-related issues.

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/13836709">Log in</a> to check if you have access.

Additional details

Software

Repository URL
https://github.com/LanceZPF/FoodSky
Programming language
Python

References

  • @article{zhou2024foodsky, title={FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination}, author={Zhou, Pengfei and Min, Weiqing and Fu, Chaoran and Jin, Ying and Huang, Mingyu and Li, Xiangyang and Mei, Shuhuan and Jiang, Shuqiang}, journal={arXiv preprint arXiv:2406.10261}, year={2024} }