News
- [2022.04.27] NEW!!! Submission website for UIE@CCKS2022 competition is open, welcome to submit your solutions!
- [2022.04.21] Our code are public released.
- [2022.04.12] Launch universal information extraction competition @CCKS2022.
- [2022.02.24] UIE accepted by ACL 2022, long paper.
Unified Structure Generation for Universal Information Extraction
Information extraction suffers from its varying targets, heterogeneous structures, and demand-specific schemas. In this paper, we propose a unified text-to-structure generation framework, namely UIE, which can universally model different IE tasks, adaptively generate targeted structures, and collaboratively learn general IE abilities from different knowledge sources. Specifically, UIE uniformly encodes different extraction structures via a structured extraction language, adaptively generates target extractions via a schema-based prompt mechanism – structural schema instructor, and captures the common IE abilities via a large-scale pretrained text-to-structure model. Experiments show that UIE achieved the state-of-the-art performance on 4 IE tasks, 13 datasets, and on all supervised, low-resource, and few-shot settings for a wide range of entity, relation, event and sentiment extraction tasks and their unification. These results verified the effectiveness, universality, and transferability of UIE.
You can find code at Github and the pre-trained models as following CAS Cloud Box/Google Drive links.
uie-en-base [CAS Cloud Box] [Google Drive] [Huggingface]
uie-en-large [CAS Cloud Box] [Google Drive] [Huggingface]
Publications
- Unified Structure Generation for Universal Information Extraction. Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun and Hua Wu. ACL 2022. [link] [arxiv] [Code in PyTorch] [Code in Paddle]
Competition
- Universal Information Extraction @CCKS 2022 [website] [Baseline in Paddle]
Other Implementations
- [PaddleNLP] UIE-Pointer: A pointer-based solution for universal information extraction
This is a pointer-based solution for universal information extraction implemented by Baidu and ISCAS. This implementation is with the same intuition as UIE, but it replaces the T5-based generation model with an ERNIE-based pointer model and further fine-tunes on public available IE datasets.
Last updated: Sep 23rd, 2022