Using Machine Learning and NLP to automate Crime
1 / 1

Using Machine Learning and NLP to automate Crime

Author : mitsue-stanley | Published Date : 2025-05-17

Description: Using Machine Learning and NLP to automate Crime Survey for England and Wales CSEW offence coding Alessandra Sozzi and Shannan Greaney ONS Contents What is the CSEW The current offence coding process The purpose of the case study

Presentation Embed Code

Download Presentation

Download Presentation The PPT/PDF document "Using Machine Learning and NLP to automate Crime" is the property of its rightful owner. Permission is granted to download and print the materials on this website for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Transcript:Using Machine Learning and NLP to automate Crime:
Using Machine Learning and NLP to automate Crime Survey for England and Wales (CSEW) offence coding Alessandra Sozzi and Shannan Greaney, ONS Contents What is the CSEW? The current offence coding process The purpose of the case study OffenceCoder model Results Contents What is the CSEW? The current offence coding process The purpose of the case study OffenceCoder model Results What is the Crime Survey for England and Wales (CSEW)? The CSEW aims to measure the extent of various crimes experienced by the public. It asks respondents whether they have experienced crime in the last 12 months. Contents What is the CSEW? The current offence coding process The purpose of the case study OffenceCoder model Results Current process The Crime Statistics team at ONS ‘dual code’ 10% (approx. 2,000 VFs per year) to check that the external company who manage the CSEW is coding correctly. It’s 1 part-time person job for 1 year (in reality 2 EOs, 7 ROs and 7 SROs) On average it takes 10-15 mins per VF Ambiguous cases requires agreement of multiple persons in the team (a decision could take days and a sign-off by a G7) Coders have to choose one of the 50+ offence codes Ambiguous VFs might be: - if the VF features more than one crime (e.g. a burglar breaks into someone’s house, beats up the occupants, steals the car and breaks some valuable belongings). A priority order is used. - Duplicates: using the example above, the respondent (or interviewer) could record each of those crimes as separate VFs but because they belong to the same incident, one VF should have been completed and one offence code should be applied Example of current coding process Coders read through the free text and the closed responses. Please note, this is not a real VF. They have to follow written guidance and flow charts (8 in total) to reach an Offence code Contents What is the CSEW? The current offence coding process The purpose of the case study OffenceCoder model Results The purpose of the case study The purpose of the case study is to assess the feasibility of doing this automatically, using Natural Language Processing (NLP) and classification techniques. Machine learning: explores the study and construction of algorithms that can learn from and make predictions on data. We use 10 years of historic manually classified VFs to build a model that can predict

Download Document

Here is the link to download the presentation.
"Using Machine Learning and NLP to automate Crime"The content belongs to its owner. You may download and print it for personal use, without modification, and keep all copyright notices. By downloading, you agree to these terms.

Related Presentations

Using Image Processing Techniques to Automate Chess Game Recording Proceedings of the Regression Testing – What to Automate and how Regression Testing – What to Automate and how MEBI 591C/598 – Text Mining/NLP Natural Language Processing Programmatic NLP Introduction to NLP Classic parsing methods AutoLearn’s  authoring NLP Working Group Chunnan Hsu “Power platform has made it so easy to automate and manage processes and quantify business CS 384: Ethical and Social Issues in NLP Automate Everyday Tasks in Jira: A practical, no-code approach for Jira admins and power Relative Measurements: Crime Severity, Property Crime Rate, Violent Crime Rate Using deep machine learning to conduct object-based identification and motion detection