Automated Extraction and Characterisation of Social Network Data from Unstructured Sources – An Ontology-Based Approach

PDF

Authors
  1. Martineau, E.
  2. Lecocq, R.
Corporate Authors
Defence Research and Development Canada, Valcartier Research Centre, Quebec QC (CAN)
Abstract
Automated extraction of social network related data is one objective of the applied research project on SNA in Counter]Insurgency context ( SNAC) at DRDC Valcartier. Since the vast majority of the information resides in unstructured text documents, the prototype must be able to extract social network related data directly from them. For these tasks, the prototype leverages and refines existing services provided by the Intelligence Science & Technology Integration Platform (ISTIP) at DRDC Valcartier. These services rely on ontologies to perform document annotation and to semantically characterise the data to be persisted. Given a list of a priory known instances of entities like people, organizations, and events, the system constructs the social web that ties these entities together. To do so, on a continuous basis, documents are fed via a data source crawling service that scans existing databases and returns new documents. Then, using natural language processing services, the system scans these incoming documents extracts and persists in a graph database information about entities as well as their relations and their respective attributes. The system also provides basic and semantic filtering services as well as conversion to many formats.
Report Number
DRDC-RDDC-2014-P5 — External Literature
Date of publication
18 Mar 2014
Number of Pages
12
DSTKIM No
CA038384
CANDIS No
538638
Format(s):
Electronic Document(PDF)

Permanent link

Document 1 of 1

Date modified: