Exploratory Data Analysis(EDA) | Data Analysis on 311, NewYork Service Request Calls


Import the required modules and load the data in panda dataframe.
As we see there are 57 features/columns and 5000 observations, we would use the relevant features alone by dropping the unwanted columns 
we would check for null values and then count of each complaints in our dataset.
Plotting a pie and bar chart on the complaint counts
Using create_date we would derive a new column for day of week using a function as shown , then we would plot a timeline series for based on the complaints raised for each day of the week.
Now we will explore on City verse Complaint by grouping city and complaint as shown below, 
Plotting a bar chart, 
We would group by Day, City and Complaint as shown below and plot charts.

No comments:

Post a Comment

Note: only a member of this blog may post a comment.