Keeping it Fresh: Predict restaurant inspections
Yelp Data Set
The Yelp dataset which was released for the academic challenge contains information for 11,537 businesses. This dataset consists of 8,282 check-in sets, 43,873 users, 229,907 reviews for these businesses. For our study, since we are only interested in the restaurant data, we have considered only those business that are categorized as food or restaurants. The two data files which used are :
1. yelp_academic_dataset_business.json
{ 'type': 'business',
'business_id': (business id),
'name': (business name),
'neighborhoods': [(hood names)],
'full_address': (localized address),
'city': (city),
'state': (state),
'latitude': latitude,
'longitude': longitude,
'stars': (star rating, rounded to half-stars),
'review_count': review count,
'categories': [(localized category names)...
}, }
2. yelp_academic_dataset_reviews.json
{ 'type': 'review',
'business_id': (business id),
'user_id': (user id),
'stars': (star rating, rounded to half-stars),
'text': (review text),
'date': (date, formatted like '2012-03-14'),
'votes': {(vote type): (count)}, }
1. yelp_academic_dataset_business.json
{ 'type': 'business',
'business_id': (business id),
'name': (business name),
'neighborhoods': [(hood names)],
'full_address': (localized address),
'city': (city),
'state': (state),
'latitude': latitude,
'longitude': longitude,
'stars': (star rating, rounded to half-stars),
'review_count': review count,
'categories': [(localized category names)...
}, }
2. yelp_academic_dataset_reviews.json
{ 'type': 'review',
'business_id': (business id),
'user_id': (user id),
'stars': (star rating, rounded to half-stars),
'text': (review text),
'date': (date, formatted like '2012-03-14'),
'votes': {(vote type): (count)}, }
Boston Data Set
Boston Heath Data record provided the ID's of the Restaurant in the Boston City along with date of inspection and the number of violations.
The number of violations is defined in three different attributes as
* -- Minor Violation
** -- Major Violation
*** -- Severe Violation
There are 25603 inspection that was done between 2006 - 2015.
The City of Boston regularly conducts restaurant inspections ensure food safety and ensure that public health rules are being followed. It records health violations for all the restaurants at three different levels: *(one star) "minor", ** (2 stars) - "major", and ***(3 stars) - "severe" violations. Currently the health inspections are random, which leads to wastage of time and efforts in inspecting clean restaurants that have been following the rules closely — and missed opportunity to improve health and hygiene at places with more serious food safety issues.
The number of violations is defined in three different attributes as
* -- Minor Violation
** -- Major Violation
*** -- Severe Violation
There are 25603 inspection that was done between 2006 - 2015.
The City of Boston regularly conducts restaurant inspections ensure food safety and ensure that public health rules are being followed. It records health violations for all the restaurants at three different levels: *(one star) "minor", ** (2 stars) - "major", and ***(3 stars) - "severe" violations. Currently the health inspections are random, which leads to wastage of time and efforts in inspecting clean restaurants that have been following the rules closely — and missed opportunity to improve health and hygiene at places with more serious food safety issues.
Driven Data
Provided the Mapping File between the Yelp Restaurant Id's and the Boston Data.