GATE In Brief

GATE has open-source models and algorithms for Information Extraction (IE), social media analytics, machine learning for IE, knowledge graphs and semantic annotation, and Natural Language Processing-as-a-service. In total, the infrastructure and all its open source components and models comprise over 350,000 lines of code.

The GATE infrastructure is unique in its offering to both researchers and companies, a comprehensive platform-as-a-service – GATE Cloud. In the past year it attracted over 290 registered users, who used the services over 37,000 times.

Large-scale Information Extraction (IE) is a particular strength, where GATE is used as the platform to extract information from the web, news wires, scientific papers, and legal and medical documents. Selected example users include: development of robust, scalable IE from patents; used by BBC in 2012 for automating the coverage of the Olympic games through semantic annotation; WHO epidemiologists who found the first gene-disease association linking lung cancer and smoking. Other examples in the UK: Garlik (from the founders of Egg Plc) to fight identity theft; Innovantage for intelligent recruiting; Fizzback (now NICE) for analysing customer feedback; the UK National Archives; the Press Association; Financial Times, the Stationery Office, Nesta, TechCity UK, Synaptica, Text Mining Solutions, Buzzfeed UK, and Public Health England.

