TY - GEN
T1 - Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
AU - Pan, Yu
AU - Sun, Jianxin
AU - Yu, Hongfeng
AU - Luck, Joe
AU - Bai, Geng
AU - Chamara, Nipuna
AU - Ge, Yufeng
AU - Awada, Tala
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The ubiquity of sensors and IoT devices has led to an explosion in data availability in modern agriculture. The large volume and heterogeneity of the data, together with the complexity of data processing requirements, pose huge obstacles for achieving the principles of Findable, Accessible, Interoperable, and Reusable (FAIR). Current data management and analysis paradigms are to a large extent traditional, in which data collecting, curating, integration, loading, storing, sharing and analyzing still involve too much human effort and know-how. The experts, researchers and the farm operators need to understand the data and the whole process of data management pipeline to make full use of the data. The essential problem of the traditional paradigm is the lack of a layer of orchestrational intelligence which can understand, organize and coordinate the data processing utilities to maximize data management and analysis outcome. The emerging reasoning and tool mastering abilities of large language models (LLM) make it a potentially good fit to this position, which helps a shift from the traditional user-driven paradigm to AI-driven paradigm. In this paper, we propose and explore the idea of a LLM based copilot for autonomous agricultural data management and analysis. Based on our previously developed platform of Agricultural Data Management and Analytics (ADMA), we build a proof-of-concept multi-agent system called ADMA Copilot, which can understand user's intent, makes plans for data processing pipeline and accomplishes tasks automatically, in which three agents: a LLM based controller, an input formatter and an output formatter collaborate together. Different from existing LLM based solutions, by defining a meta-program graph, our work decouples control flow and data flow to enhance the predictability of the behavior of the agents. Experiments demonstrate the intelligence, autonomy, efficacy, efficiency, extensibility, flexibility and privacy of our system. Comparison is also made between ours and existing systems to show the superiority and potential of our system.
AB - The ubiquity of sensors and IoT devices has led to an explosion in data availability in modern agriculture. The large volume and heterogeneity of the data, together with the complexity of data processing requirements, pose huge obstacles for achieving the principles of Findable, Accessible, Interoperable, and Reusable (FAIR). Current data management and analysis paradigms are to a large extent traditional, in which data collecting, curating, integration, loading, storing, sharing and analyzing still involve too much human effort and know-how. The experts, researchers and the farm operators need to understand the data and the whole process of data management pipeline to make full use of the data. The essential problem of the traditional paradigm is the lack of a layer of orchestrational intelligence which can understand, organize and coordinate the data processing utilities to maximize data management and analysis outcome. The emerging reasoning and tool mastering abilities of large language models (LLM) make it a potentially good fit to this position, which helps a shift from the traditional user-driven paradigm to AI-driven paradigm. In this paper, we propose and explore the idea of a LLM based copilot for autonomous agricultural data management and analysis. Based on our previously developed platform of Agricultural Data Management and Analytics (ADMA), we build a proof-of-concept multi-agent system called ADMA Copilot, which can understand user's intent, makes plans for data processing pipeline and accomplishes tasks automatically, in which three agents: a LLM based controller, an input formatter and an output formatter collaborate together. Different from existing LLM based solutions, by defining a meta-program graph, our work decouples control flow and data flow to enhance the predictability of the behavior of the agents. Experiments demonstrate the intelligence, autonomy, efficacy, efficiency, extensibility, flexibility and privacy of our system. Comparison is also made between ours and existing systems to show the superiority and potential of our system.
KW - Agricultural Data Management
KW - Autonomous
KW - Copilot
KW - FAIR principles
KW - Large Language Model
KW - Multi-Agent
KW - Paradigm shift
UR - http://www.scopus.com/inward/record.url?scp=85218027835&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85218027835&partnerID=8YFLogxK
U2 - 10.1109/BigData62323.2024.10826038
DO - 10.1109/BigData62323.2024.10826038
M3 - Conference contribution
AN - SCOPUS:85218027835
T3 - Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024
SP - 4384
EP - 4393
BT - Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024
A2 - Ding, Wei
A2 - Lu, Chang-Tien
A2 - Wang, Fusheng
A2 - Di, Liping
A2 - Wu, Kesheng
A2 - Huan, Jun
A2 - Nambiar, Raghu
A2 - Li, Jundong
A2 - Ilievski, Filip
A2 - Baeza-Yates, Ricardo
A2 - Hu, Xiaohua
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Big Data, BigData 2024
Y2 - 15 December 2024 through 18 December 2024
ER -