Sciweavers

PADL
2015
Springer

Ontology-Driven Data Semantics Discovery for Cyber-Security

8 years 7 months ago
Ontology-Driven Data Semantics Discovery for Cyber-Security
Abstract. We present an architecture for data semantics discovery capable of extracting semantically-rich content from human-readable files without prior specification of the file format. The architecture, based on work at the intersection of knowledge representation and machine learning, includes machine learning modules for automatic file format identification, tokenization, and entity identification. The process is driven by an ontology of domain-specific concepts. The ontology also provides an ion layer for querying the extracted data. We provide a general description of the architecture as well as details of the current implementation. Although the architecture can be applied in a variety of domains, we focus on cyber-forensics applications, aiming to allow one to parse data sources, such as log files, for which there are no readily-available parsing and analysis tools, and to aggregate and query data from multiple, diverse systems across large networks. The key contributions of o...
Marcello Balduccini, Sarah Kushner, Jacquelin Spec
Added 16 Apr 2016
Updated 16 Apr 2016
Type Journal
Year 2015
Where PADL
Authors Marcello Balduccini, Sarah Kushner, Jacquelin Speck
Comments (0)