基于水利一张图的地理空间信息问答智能体技术

    Research on large model agent technology of geospatial information question answering based on water conservancy map

    • 摘要: 当前,生成式人工智能技术,特别是自然语言大模型的兴起,为地理空间信息获取提供了新的思路。然而,现有的大模型和搭载大模型的智能体技术,主要基于通用数据集上训练的,当应用于在地理空间信息上的问答时,容易出现幻觉,存在回答内容相关度较低、回答不准确、缺乏实时性等问题。针对该问题,本文提出基于水利一张图的地理空间信息问答大模型智能体技术框架(GLMA),该技术框架主要利用智能体,理解用户的自然语言提问,驱动大模型执行水利一张图任务,再利用大模型融合提问和任务的返回结果,生成最终的提问回复内容。为了提高任务执行的准确性与有效性,在任务分配阶段,本文使用了树形的任务分配结构,有效地提高了任务检索和参数生成的能力。此外,为了验证GLMA的有效性,本文构建了一套地理空间信息问答数据集,并设定了相应的评估指标。在与最新的中文开源大模型Baichuan2、Llama3.1、ChatGLM4、Qwen2.5等的对比测试中,GLMA在任务分配准确率和查询结果准确率等评估指标上取得了最好的效果。本研究具有一定的扩展性,将为其它业务领域的大模型智能体研究奠定基础。

       

      Abstract: At present, generative artificial intelligence technology represented by natural language large model has ushered in vigorous development, providing a new way to obtain geospatial information. However, existing large models and agents equipped with large models, most of which are trained on general datasets, are highly prone to issues such as hallucinations, such as low correlation of answer content, inaccurate answer and lack of real-time performance. To address these issues, this paper proposes a technical framework for a large language model intelligent agent for geospatial information Q&A based on “one map” of water conservancy (GLMA). This technical framework integrates the agent's natural language understanding capabilities, enabling it to accurately grasp the user's question intent and subsequently drive the large model to execute specific "Water Resources Single Map" tasks. To enhance the accuracy of the task allocating, a tree-like structure task allocation system is used in the task dispatch stage, significantly improving the capabilities for task retrieval and parameter generation. Additionally, to validate the effectiveness of GLMA, this paper constructs a dataset for geospatial information question answering and establishes corresponding evaluation metrics. Compared with state-of-the-art Chinese open-source models such as Baichuan2, Llama3.1, ChatGLM4, and Qwen2.5, GLMA achieves the best results in terms of task allocation accuracy and query result accuracy. This research possesses significant extensibility and will lay the foundation for further studies on large model agents in other business domains.

       

    /

    返回文章
    返回