A Big Data Application built with
- Over 7.4 Million data records covering 10-year quarterly US real estate market transaction information at zipcode level (Downloaded from Redfin Data Portal);
- Over 42 Thousand data records covering all US zipcode and corresponding primary city information.
Download Data and Ingest to AWS EMR Cluster
HDFS File Structure:
./yvesyang
/zipcode
/zipcode_city.csv
/new_2023_data
/zip_market_after_2023.csv
/history_data
/zip_market_before_2023.csv
Launch Hive in EMR: beeline -u jdbc:hive2://localhost:10000/default -n hadoop -d org.apache.hive.jdbc.HiveDriver
Create Hive Table from raw source csv files
- yvesyang_before2023
- yvesyang_after2023
- yvesyang_zipcode
Connect US zipcode data to US Housing Market data
- yvesyang_combined_before2023
- yvesyang_combined_after2023
Web UI Add generative map in the web ui?