Project description
Internet is evolving rapidly, being present in every aspect of our lives. Today’s society except from using Internet for fun or work, has also to handle multiple cyber threats. The major objective of the SMC project, that is financed by the National Centre for Research and Development (NCBiR) (contact no. 0079/R/T00/2010/11), is to create methods and a prototype that enable integration of information and data from various sources, in order to provide means for automatic detection of content-related cyber threats.
Characteristics of methods and a prototype being developed
- Sources of information of the SMC system are public content (e.g. articles and comments on web pages, posts on discussions forums, ads) from the shallow and the deep Web. Project includes processing of natural language unstructured data.
- Data being processing comes from heterogeneous, structured and unstructured sources.
- Information sources are constantly monitored to detect irregularities that may be effect of occurred threat.
- Information about a certain class of a threat is stored in threat metaprofiles, that enable filtering of information from sources being monitored.
- Threat profiles are built and evolve according to rules defined by experts.
- Domain experts contribution is necessary only while defining rules, not during the process of monitoring for certain threat class.
Unique features
- Information extraction from chosen Internet sources, like social network sites or on-line auction services.
- Integrating data from diverse sources, from public web sites as well as from internal databases.
- Automatic detection of content-related cyber threats in monitored sources.
- A threat is identified based on rules defined by experts. Thus proposed solution can be used to detect different threat types.
Components