ERROR: Content Element type "sr_language_menu_pi1" has no rendering definition!
Open and Closed Data Sources: Documents in different formats, TV, Radio, and Information in closed legacy systems are the data sources to be mined and evaluated by CAPER. In addition to general Internet data sources, CAPER will also integrate mass media, internal LEA information systems and access to Semantic Web data collections.
The CAPER project will be designed from a linguistically neutral point of view. This design methodology will allow linguistic analysis and speech recognition components for any language to be added in the future. The initial languages to be supported by CAPER are listed in WP 5.2. This list of languages has been agreed with participating LEAs. CAPER will be capable of acquiring content from multiple sources and in multiple formats and several. LEA users will provide reference images, keywords, biometric data, and define concepts to be used in information acquisition. For Mass Media capture Channel information will be required.
A key goal of the CAPER system will be the need to be able to make like for like comparisons between different information types (text, image, voice, etc.) and form data with different cultural biases. This goes beyond data or web mining and will require tailorable algorithms that can make a broad classification of multiple data sources or a specific piece of atomic content.
The CAPER project will contain a Visual Analytics component that will allow interactive decision making and hypotheses building. The VA work package will also include the development of mining and inference modules with a rich graphical interface for the representation and interaction evaluated information. It will allow both closed and open intelligence to be compared and contrasted. Visual Analytics has been successfully applied to the banking sector for Anti-Money Laundering Applications and Credit Card Fraud and in IP protection services to detect Trademark Abuse and Defamation.
Standardisation is essential for interchange of data and tools. Once a data format is accepted as a standard, tools can be developed and shared with little data conversion effort.
8KAF has been developed in the KYOTO project, an EU co-funded project (FP7 ICT Work Programme 2007 - Digital libraries and Content, Intelligent Content and Semantics objective.
Using best of class ETL (Extraction, Transformation and Load) tools provided by partner ALTIC, the ability of the system to be reasonably integrated with legacy information systems is assured. Additional modules such as the inclusion of IBM ICU in the technology mix will provide the technologies required to enable a consistent and correct conversion of location sensitive data between system components. Finally, the CAPER system will be built on industry standard and open technologies and fitted with both a Web Services layer and traditional Java/C API interfaces.
The common management and workflow application and analysts’ workbenches will be developed in compliance with prevailing secure systems standards and certification to guarantee intrinsic security. Its design will also address both national and European legal concerns. More importantly this layer will also use Semantic technologies to enhance the semantic interoperability of the system ensuring that LEA users will have a common language and set of concepts when using the system.
The project contains a specific work package in which legal, ethical and societal issues will be identified (WP7). In collaboration with participating LEAs, these will be addressed in the systems design and operational phases of the project. The system architecture foreseen is compartmentalised and will provide sufficient flexibility to react to any legal barriers that might arise during the execution of the project. The project will also produce policy and best practice recommendations based on this legal study.
The project will be structured specifically towards the needs of the participating Law Enforcement Agencies. Two work packages exist to achieve this goal. WP2, whose goal is to allow the LEAs analyse and define their requirements, and WP8 which will allow the participating LEAs perform integration tests and field trials of the CAPER platform in partnership with the wider consortium. The project concept has in part been shaped by the fact that several members of the project consortium are drawn from different disciplines and have existing experience in data security, Open Source Intelligence, Data mining, content analysis and linguistic expertise. The CAPER project strategy will be to engage the end users at the beginning of the project. Demonstrated what technologies are available to them initially, and in cooperation with the CAPER consortium, the constituent components will be integrated and further developed to meet the LEAs’ needs.
To achieve these goals, the participating LEAs will