Dataset 1: Air Quality Data
Type of data: Environment
Data compiled by: Environmental Protection Department
Focus: 1) By District , 2) Specific air pollutants , 3) Static reads by hours
Insight: Provide health suggestions for Outdoors
Dataset:


Dataset 2: AQHI Records
Type of data: Environment
Data compiled by: Environmental Protection Department – Air Quality Health Index
Focus: 1) By District , 2) Record in hour , 3) In scale
Insight: Provide health suggestions for Outdoors
Data Cleaning: 1) Sub blanks with N/A , 2) Remove all irrelevant signs


Dataset 3: Hong Kong Weather Reports
Type of data: Climatic
Data compiled by: Hong Kong Observatory
Focus: 1) Accumulated , 2) Static value
Insight: Provide suggestions / Relative scales for people reference (daily use/property save)
Data Cleaning: Sub “microscale” with 0


Dataset 4: Hong Kong’s Average Maximum and Minimum Temperatures
Type of data: Climatic
Data compiled by: Hong Kong Observatory
Focus: Daily Maxium and Minimum
Insight: Energy and Resource Management
Data Cleaning: Sub blanks with 0


Dataset 5: Hong Kong Dollar Exchange Rate
Type of data: Finance
Data compiled by: –
Focus: 1) Daily closing figure , 2) 13 years of data
Insight: Make some time-period model for people who are planning for trip or own expenditure
Data Cleaning: Delete column no longer exist

Dataset 6: Hong Kong Wifi.hk Information
Type of data: Information Technology
Data compiled by: Office of the Government Chief Information Officer – Fixed wifi hk locations
Focus: 1) Hong Kong wifi locations
Insight: The Hong Kong Wifi.hk dataset offers insights into Wi-Fi accessibility, aiding policymakers, businesses, and tourists.
Data Cleaning: 1)Sub blanks with N/A


Dataset 7: KMB Bus Stop Information
Type of data: Transportation
Data compiled by: https://data.etabus.gov.hk
Focus: 1) Extract and understand real-time arrival information from KMB’s data dictionary.
Insight: The dataset contains bus stop information, including stop names in different languages, geographical coordinates, and routes.
Data Cleaning: 1)Sub blanks with N/A


Dataset 8: KMB Bus Arrival Time
Type of data: Transportation
Data compiled by: https://data.etabus.gov.hk
Focus: 1) Extract and understand real-time arrival information from KMB’s data dictionary.
Insight: The dataset contains bus stop information, including stop names in different languages, geographical coordinates, and routes.
Data Cleaning: 1)Sub blanks with N/A


Dataset 9: Market Data
Type of data: Life
Data compiled by: Common Spatial Data Infrastructure
Focus: Market data focuses on economic trends to influence job markets and living costs, aiming for growth and price stability.
Insight: can reveal job growth sectors, inflation rates, and consumer behavior, offering valuable guidance for personal financial decisions.
Data Cleaning: 1)Sub blanks with N/A


Dataset 10: Gym Facility Data
Type of data: Recreation and Culture
Data Compiled by: https://www.data.gov.hk/
Focus: –
Insight: –
Data Cleaning: 1)Duplicate data


Dataset 11: Population Census Statistics
Type of data: Council
Data Compiled by: Common Spatial Data Infrastructure
Focus: By District Council Constituency Area
Insight: Using district-level demographic data helps policymakers and businesses tailor services and planning to meet specific community needs, improving effectiveness in public services, healthcare, and the economy.
Data Cleaning: 1)Sub blanks with N/A


Dataset 12: Half-way and Handicapped Dormitory
Type of data: Social Welfare
Data Compiled by: https://www.data.gov.hk/
Focus: Hostel for Moderately Mentally Handicapped Persons
Insight: It provides home living for persons with moderate mental handicap who are capable of basic self-care but lack adequate daily living skills to live independently in the community.
Data Cleaning: 1)Sub blanks with N/A


Dataset 13: Charger Location Data for Public Use
Type of data: Transportation
Data Compiled by: https://www.data.gov.hk/
Focus: It shows the longitude and latitude, location, model, location sense latitude and longitude of the charger
Insight: Provide charger location
Data Cleaning: Missing values fill in with 0


Dataset 14: Distribution of Metered Parking Spaces and Utilization of New Metered Parking Spaces
Type of data: Transportation
Data Compiled by: Common Spatial Data Infrastructure
Focus: Location, the type of parking space that can be used, and the maximum parking time.
Insight: Help public to find out parking space
Data Cleaning: Missing values fill in with 0


Dataset 15: Inpatient Discharges and Deaths and Territory-wide Registered Deaths by Type of Disease
Type of data: Medical
Data Compiled by: https://www.data.gov.hk/
Focus: based on different diseases, shows the number of discharges and deaths in public and private hospitals, the number of registered deaths in Hong Kong for men , women and persons of unknown gender
Insight: learn about which diseases can treat better result in public or private hospital
Data Cleaning: –


Dataset 16: Mid-term Census
Type of data: Council
Data Compiled by: Common Spatial Data Infrastructure
Focus: By District Council Constituency Area
Insight: Using district-level demographic data helps policymakers and businesses tailor services and planning to meet specific community needs, improving effectiveness in public services, healthcare, and the economy.
Data Cleaning: 1)Sub blanks with N/A


Dataset 17: Information on distribution of anti-epidemic service packs in Sham Shui Po District in April 2022
Type of data: Social Welfare
Data Compiled by: Common Spatial Data Infrastructure
Focus: Show the number of household in different building
Insight: Learn about the number of household in different building in Sham Shui Po
Data Cleaning: 1)Sub blanks with Null


Dataset 18: Number of Road Traffic Accidents Number of Traffic Accidents at Junctions by Type of Junction, Junction Control Method and Severity
Type of data: Transportation
Data Compiled by: https://www.data.gov.hk/
Focus: Analyzing the impact of junction type, control method, and accident severity on the frequency of road traffic accidents.
Insight: Junction type, control methods significantly affect accident rates; roundabouts safest, signal-controlled intersections riskiest; severity varies.
Data Cleaning: 1)Sub blanks with N.A.


Dataset 19: Discharge water quality of water treatment plants (daily)
Type of data: Environment
Data Compiled by: https://www.data.gov.hk/
Focus: By Sewage Treatment Location (Ma Wan/Stonecutters island) Grab Effluent Sample E. coli (count/100mL)
Insight: provide public health protection advice
Data Cleaning: 1)Sub blanks with NA


Dataset 20: River Water Quality Data
Type of data: Environment
Data Compiled by: Environmental Protection Department
Focus: Show the chemical material that the river has
Insight: Provide water resource management advice
Data Cleaning: –