欢迎访问《高校地质学报》官方网站,今天是
分享到:

高校地质学报 ›› 2023, Vol. 29 ›› Issue (3): 337-344.DOI: 10.16108/j.issn1006-7493.2023025

• 固体地球科学文本挖掘和知识图谱专栏 特邀主编:马 超 诸云强 闾海荣 胡修棉 • 上一篇    下一篇

基于注意力机制的孪生网络地质调查空间实体与文本信息匹配

邱芹军1,2*,马 凯3,4,谢 忠1,2,陶留锋1,2,黄 波5   

  1. 1. 中国地质大学(武汉)计算机学院,武汉 430074;
    2. 智能地学信息处理湖北省重点实验室,武汉 430074;
    3. 湖北省水电工程智能视觉监测重点实验室,宜昌 443002;
    4. 三峡大学 计算机与信息学院,宜昌 443002;
    5. 武汉中地数码科技有限公司,武汉 430074
  • 出版日期:2023-06-20 发布日期:2023-06-20

Spatial Entity-Text Information Matching for Geological Surveys Based on Siamese Network with Attention Mechanism

QIU Qinjun1,2*,MA Kai3,4,XIE Zhong1,2,TAO Liufeng1,2,HUANG Bo5   

  1. 1. School of Computer Sciences, China University of Geosciences, Wuhan 430074, China;
    2. Hubei Key Laboratory of Intelligent Geo-Information Processing, China University of Geosciences, Wuhan 430074, China;
    3. Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering, China Three Gorges University, Yichang 443002, China;
    4. College of Computer and Information Technology, China Three Gorges University, Yichang 443002, China;
    5. Wuhan Zondy Cyber Technology Ltd., Co., Wuhan 430074, China
  • Online:2023-06-20 Published:2023-06-20

摘要: 对来源不同的地质对象进行关联匹配,并通过模型对其结构、属性及语义关系进行表示是后期语义查询及聚类等任务的重要支撑。文章针对地质调查空间实体与外部文本描述语义异构、表达差异等问题,提出了一种基于注意力机制的孪生网络地质调查空间实体与文本描述信息关联匹配模型。首先,将地质调查空间实体的属性信息转换成为文本段落,以句向量基本粒度对地质空间实体进行文本语义编码;接着将两类文本对象映射到统一向量空间中,并输入到孪生网络中进行特征学习,最后在构建真实数据集上进行模型性能的实验测评。结果显示,该模型能够较好表示地质调查空间实体句子语义信息,其识别F1值相比基准实验提高了8.4个百分点,优于选取的对比方法。

关键词: 地质调查实体, 文本多语义表征, 信息匹配, 语义相似性

Abstract: Association matching of geological objects with different sources and representation of their structures, attributes and semantic relationships by models is an important support for later tasks such as semantic query and clustering. In this paper, we propose a twin network geological survey spatial entities and text description information association matching model based on attention mechanism for the problems of semantic heterogeneity and expression differences between geological survey spatial entities and external text descriptions. First, the attribute information of geological survey spatial entities is converted into text paragraphs, and the text semantics of geological spatial entities is encoded with the basic granularity of sentence vectors; then the two types of text objects are mapped into a unified vector space and input to the twin network for feature learning, and finally the experimental evaluation of model performance is conducted on the constructed real dataset. The results demonstrate that the model can better represent the sentence semantic information of geological survey spatial entities, and its recognition F1 value is improved by 8.4 percentage points compared with the benchmark experiment, which is better than the selected comparison method.

Key words: geological survey entities, textual semantic representation, information matching, semantic similarity

中图分类号: