Alessandro Seganti

Industry

VAT Fraud detection with streaming reasoning

I will be presenting the fraud detection system based on the Cognitum Platform that we have implemented for a state in Brazil. The system is a rule based system that is analyzing a stream of bills in real time. The system is currently helping the fraud analyst of the ministry of finance to recognize quicker and more effectively VAT frauds. Our system is a combination of formal knowledge representation (using SWRL rules) and a statistical engine combined together. During my presentation I will highlight the challenges found during the implementation and the first time in production of the system and the lessons learned from our mistakes.

Detection of large-scale fraudulent activities requires heavy usage of the online (real-time) data analysis. It requires complicated, and time-consuming investigations and deals with various domains of knowledge like financial, economics, business practices and law. Nevertheless, the real challenge is to build adaptive and self-learning fraud detection system, as it needs special methods of intelligent data analysis to detect and prevent losses.
On the one hand, desired fraud detection system must be able to deal with gargantuan computational complexity. It needs to recognise complex patterns over time periods spanning seconds to months. It also must be easily customizable and readily maintainable by specialists in frequently changing business environment. Ensuring compliance and finding fraud requires monitoring millions of daily transactions in real time and not error-prone invoice data which complicates processing and analysis. On the other hand, auditable proof of non-compliance is critical to tax enforcement and needs to be provided by the system too.
A state in Brazil dealt with revenue loss on VAT estimated at the level of 80M USD per year in retail only.
The semantical reasoning engine of the Fraud Detection System recognises complex fraud and non-compliance patterns. Natural language rules enable decision makers and specialists to manage and maintain tax fraud knowledge base by themselves, with an only sporadic support of programmers. Situation awareness is provided by operation dashboard that combines data visualisations and reports to give operators and management comprehensive situational awareness of fraud detection analysis and prevention.
I will be presenting the fraud detection system based on the Cognitum Platform that we have implemented for a state in Brazil. The system is a rule based system that is analyzing a stream of bills in real time. The system is currently helping the fraud analyst of the ministry of finance to recognize quicker and more effectively VAT frauds. Our system is a combination of formal knowledge representation (using SWRL rules) and a statistical engine combined together. During my presentation I will highlight the challenges found during the implementation and the first time in production of the system and the lessons learned from our mistakes.

CV

I am Scrum Product Owner in all projects in the company: managing a team of 8 programmers and developing new products and functionalities for products used by more than 4000 users and by governments.

I am the main contact for key international clients and I help the company in winning new business.

I published several scientific articles on our work and regurarly representing the company at international conferences (CNL2015, CNL2016, OWLED2015, Semantics2015, Semantics2016).

I am the main architect of Ask Data Anything, a product to integrate all kind of data sources using semantic technologies: creating a natural language parser using NLP libraries, mapping natural language to complex SQL queries, architecting a modular solution for online analytics using F#.

I build scalable and highly reliable cloud based applications using C#, Windows Azure, WCF services and NoSql databases. We also use extensively actor based programming (e.g. Akka.Net).
All programs that I develop use semantic technologies standards as OWL, RDF, SPARQL. For this reason I have knowledge of the theory behind these technologies (Description Logic, Horn Logic, SWRL).

I program in Java and Scala and I use (and contribute) to open source projects that need to be integrated in our system (OWLAPI, TinkerPop blueprints). I am a committer in the Jena Triple Store.

I am Developing desktop and web applications in the Windows environment using C#, F#, WPF, SQL, WCF, ASP.NET (Core, WebApi, Identity Core), WIX installer, Javascript (Nodejs, React, Redux, Webpack,...) and Html5.

I have a deep knowledge of NoSql (Datastax Cassandra) and Graph databases (Titan, Virtuoso, Sesame, Neo4J,..).