Skip to main content

Detecting Vulnerable regions in metropolitan cities

Introduction

The problem is to handle the growing violence rate by estimating the probability of the upcoming violence, especially in metropolitan cities.

Why is the problem important?

This is important since if by doing so, we could somehow able to stop even 10-15% of upcoming threat then it can have a vast effect.

Who will benefit :

Police can analyze data in real time and may increase patrolling if required.

Based on available data, police can effectively maintain law and order in vulnerable areas.

Our strategy

For this we chose the social media platform twitter

1) First of all we collected tweets with geo tagged locations for the last 7 days for 4 citites hyderabad, mumbai, kolkata and delhi

2) But only 2% of total tweets have geo tagged locations.

So what we have done is that, we made a dictionary of areas of these cities from maps of india and find  the location if it is mentioned in the tweet
like My bag is stolen from CP Delhi.

Third thing is that twitter provide user location field like i m from delhi but tweeting from hyderabad then also that location is useful somtimes.

After getting locations, We performed sentiment analysis using Crowdflower dataset as training dataset and our collected tweets as test data, in which there are 3 categories, harmful, non harmful and normal.

4) After classifying all tweets, we plotted the heatmap using Google maps api to show red regions as per harmfulness of tweets




Why location is important to us? 

To find vulnerable areas our first priority is Geo-tagged location. If there is a tagged location in tweet then we are done with the location if not we choose alternatives like location mentioned in tweet, or user location. How the hell is user location is important? Suppose your home is in Delhi but you are tweeting from Hyderabad, about a riot in a your home location in Delhi. So if user will search some kind of tweet, with a specific keyword, user location might be useful to analyse a tweet.


Application Interface :


User can search tweet based on location, keyword or date.

We will show total number of tweets, neutral tweets etc.











References : 




Comments

Popular posts from this blog

InstaBully

Introduction Cyber bullying has become prevalent in today's social media driven world. Awareness about it however, is not very widespread. Given that there is usually no escape for cyber bullying victims from their bullies, it is even more devastating than traditional bullying. Sometimes it is also hard to distinguish between simple negative interactions and cyber-bullying. Keeping this in mind we wanted to create a program that would help detect cyber bullying on Instagram accounts given only a username. Relevance In India, nearly 40% of people have never heard of cyber-bullying. Furthermore a majority of people think that current cyber-bullying measures are insufficient. 45% of parents say that their children have been cyber-bullied. Out of all the various ways in which people can be bullied online social media is the most common and also the most personal.  Although the nature of the bullying changes from platform to platform the effect does not change. we picked...

Traffic Violations in Metropolitan Cities

Introduction With the advent of the smartphone era and the availability of 4G internet across the country, police forces have begun to use electronic receipts of the traditional traffic challans. E-Challans are electronically generated penalty receipt that takes the place of the physical paper receipts and helps in digitizing the whole process of collecting challans and penalizing violations. In this project, we analyze the set of all unpaid E-Challans collected in metropolitan cities over a large span of time to gain unique insights about the nature of traffic violations in such cities. The problem is very relevant for a course on Big Data & Policing as it tries to answer the following important questions: How are traffic violations distributed spatially and temporally across the city boundaries? Can the most common violation types be characterized and be used for providing intervention insights? How can police leverage social media for increasing awareness and for targe...

Human Trafficking dataset creation & analysis

Introduction The goal of this project is to create a Human Trafficking dataset from reliable sources such as news articles, Government agencies, etc and analyse the pain points in this area. Motivation   What is human trafficking? Human trafficking involves recruitment, harbouring or transporting people into a situation of exploitation through the use of violence, deception or coercion and forced to work against their will. In other words, trafficking is a process of enslaving people, coercing them into a situation with no way out, and exploiting them. What is it important?   Did you know that in 2015 alone, Human Trafficking generated $150 billion, more revenue  than Google, Nike, The NFL and Starbucks combined ?!?!   Sounds crazy right? Well there is more to this story than you know, that's why 18th of October is the EU Anti-Trafficking Day.According to a September 2017 report from the International Labor Organization (ILO) and Walk Free Fo...