Ontology-Driven Data Semantics Discovery for Cyber-Security

8 years 7 months ago

Download sarahkushner.com

Abstract. We present an architecture for data semantics discovery capable of extracting semantically-rich content from human-readable files without prior specification of the file format. The architecture, based on work at the intersection of knowledge representation and machine learning, includes machine learning modules for automatic file format identification, tokenization, and entity identification. The process is driven by an ontology of domain-specific concepts. The ontology also provides an ion layer for querying the extracted data. We provide a general description of the architecture as well as details of the current implementation. Although the architecture can be applied in a variety of domains, we focus on cyber-forensics applications, aiming to allow one to parse data sources, such as log files, for which there are no readily-available parsing and analysis tools, and to aggregate and query data from multiple, diverse systems across large networks. The key contributions of o...

Marcello Balduccini, Sarah Kushner, Jacquelin Spec

Real-time Traffic

PADL 2015 | Programming Languages |

claim paper

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	PADL
Authors	Marcello Balduccini, Sarah Kushner, Jacquelin Speck

Comments (0)

Sciweavers

Ontology-Driven Data Semantics Discovery for Cyber-Security

PADL 2015 | Programming Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers