Predicting neighborhoods' socioeconomic attributes using restaurant data

Proceedings of the National Academy of Sciences (PNAS)

by Lei Dong (MIT), Carlo Ratti (MIT), and Siqi Zheng (MIT)

Abstract

Accessing high-resolution, timely socioeconomic data such as data on population, employment, and enterprise activity at the neighborhood level is critical for social scientists and policy makers to design and implement location-based policies. However, in many developing countries or cities, reliable local-scale socioeconomic data remain scarce. Here, we show an easily accessible and timely updated location attribute—restaurant—can be used to accurately predict a range of socioeconomic attributes of urban neighborhoods. We merge restaurant data from an online platform with 3 microdatasets for 9 Chinese cities. Using features extracted from restaurants, we train machine-learning models to estimate daytime and nighttime population, number of firms, and consumption level at various spatial resolutions. The trained model can explain 90 to 95% of the variation of those attributes across neighborhoods in the test dataset. We analyze the tradeoff between accuracy, spatial resolution, and number of training samples, as well as the heterogeneity of the predicted results across different spatial locations, demographics, and firm industries. Finally, we demonstrate the cross-city generality of this method by training the model in one city and then applying it directly to other cities. The transferability of this restaurant model can help bridge data gaps between cities, allowing all cities to enjoy big data and algorithm dividends.

Web | Paper | Appendix | Slides

Replication data and code

restaurant_replication.R
- R code to replicate the results of the paper
data_dianping
- Dianping restaurant data of nine cities
- Baoding, Beijing, Chengdu, Hengyang, Kunming, Shenyang, Shenzhen, Yueyang, and Zhengzhou
rst
- Model training results
- Download
feature
- Feature for training
- Download

Contact: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data_dianping		data_dianping
README.md		README.md
Restaurant_Slides.pdf		Restaurant_Slides.pdf
food-small.jpg		food-small.jpg
restaurant_replication.R		restaurant_replication.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting neighborhoods' socioeconomic attributes using restaurant data

Abstract

Replication data and code

About

Releases

Packages

Languages

leiii/restaurant

Folders and files

Latest commit

History

Repository files navigation

Predicting neighborhoods' socioeconomic attributes using restaurant data

Abstract

Replication data and code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages