Skip to content

GhosanandaW/CS513

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CS513

Overview

This repository shows the overall data cleaning process and provenance of NYPL data for the U1 analysis

U1 target (main) use case: Analysis of the evolution of prominence (in terms of image size compared to menu size) and its correlation to price of specific dishes across different decades in New York.

Description

The goal of this analysis is to understand how the prominence of specific dishes, as indicated by the image size relative to the total menu area, has evolved over different decades in New York and how it affected the price. This involves examining the changes in how dishes are visually represented on menus and identifying trends in menu design and dish popularity over time and its effect on pricing.


Queries

  1. How has the average image size of popular dishes relative to the total menu size changed over each decade from the 1850s to the 2000s in New York?
  2. How do the image sizes of different dishes compare on menus from different decades?
  3. Are there particular decades where certain dishes became more visually prominent compared to others?
  4. How has price affected (independent variable) based on evolution of popular dishes and its menu image size (dependent variable) in the decade?

Tech stack used

  1. OpenRefine
  2. YesWorkflow and or2yw
  3. Logica
  4. DuckDB and SQL

Overall outer workflow

alt text

About

Repository for NYPL data cleaning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published