This project was done in the context of EPFL | Media & Design Lab | Personal Interaction Studio 2018
Course Tutors:
- Immanuel Koh
- Jeffrey Huang
In this repository, we collect data from multiple sources regarding renewal areas in Taipei City starting year 2012.
Our data is obtained thanks to
- Taipei Historical Maps
- 地號 GeoJSON API for GeoJson Files
- Taipei Urabn Development Department for Illegal Rooftops
- Taipei Urban Regeneration Office for Renewal Areas
- Taiwan National Development Council for Land Prices
- In the cloned directory, use
python -m http.server
to run the server http://localhost:PORT_NUM/Website/index.html
, and here it goes!
In this folder, we organize our obtained and cleaned data.
The final result concatenating everything is Final.csv
Particular files of interest:
Land Prices 201X.csv
: Land price data for year 201XOutput.csv
: List of Illegal Rooftops and their coordinates
This folder contains jupyter notebook files where we experimented with different methods.
GreenFinder.ipynb
: This notebook shows how we calculate the vegetation indicesTSNE.ipynb
: This notebook shows how we built our TSNE posters
N.B.: The images are not uploaded to github due to their number and size. They can be recovered by using Scripts\SnapRenewalAreas
in QGIS
This folder contains .tex files we used to generate our report for the course
This folder contains the scripts used to download and parse renewal areas data. It also contains the data in different subfolders:
GeoJson
: GeoJson data files for each renewal areaLandNumber
: Subdivisions of each renewal areaText
: Raw uncleaned text obtained after OCR on the scans
Data was collected with the following pipeline:
Crawler/Renewal Pdf Crawler.ipynb
: Retrieves renewal pdfs from Taipei Urban Regeneration Officeextract.py
: Takes only the first few pages from the raw pdf to speed up the OCR processOCR.py
: Gets text from the extracted pdf using Google Drive APIparser.py
: Basic cleans-up and parses land numbers from the text we obtainedgetJson.py
: Gets GeoJson files using 地號 GeoJSON API
Raw data for the illegal rooftop locations (District/Street Name)
Python scripts used in QGIS in order to analyse and generate our final.csv
file as well as snapshots at different zoom levels for the satellite and infrared imagery:
AddressToLatLon.py
: Transforms the raw illegal rooftop address into usable (lat,lon) coordinatesDrawOnMap.py
: Reads the illegal rooftops and renewal area data and displays them on the mapGenerateRenewalsQGIS.py
: Generates the first part of the information about renewal areas relating to illegal rooftopsGenerateRenewalPython.py
: Generates the second part of the information abotu renewal areas relating to land prices, vegetation and datesSnapIllegalRooftops.py
: Takes snapshots of all the illegal rooftops and stores them in a specified folderSnapRenewalAreas.py
: Takes snapshots of all the renewal areas and stores them in a specified folderUtils.py
: Commonly used functions and variables
This folder contains the website that demonstrates our result.
index.html
js/ui.js
css/columns.css
Icons thanks to
from Flaticon