Data Science Projects

Diverse Data Hub
Summary: Open educational resource offering curated data sets focused on equity, diversity, inclusion, and other socially relevant topics. It is designed to support students, educators, and researchers in accessing and working with meaningful data in their teaching, learning, and analysis.
Skills: Quarto, R, Regression, Statistics, R Packaging
UBC MDS Collaborators: Katie Burak (@katieburak), Azin Piran (@azinpiran), Siddarth Subrahmanian (@siddart1)
Commuting Insights
Summary: The dashboard visualizes commuting behavior across Canada, including transportation modes, average commute times, and regional dependencies on specific means of transport. The goal being to inform policies that can lead to better, more sustainable commuting solutions across the country.
Skills: Dash, Altair, Render.com, Python, Parquet
UBC MDS Collaborators: Eugene You (@jinxyou), Derek Rodgers (@derekrodgers), Han Wang (@hanwang205)
Spotify Spin Stats
Summary: This dashboard visualizes various aspects of music, including genres, characteristics (danceability, energy, valence, etc.), explicit content, and popularity levels of a wide variety of albums available on Spotify. This app provides insights to help music labels and emerging artists understand which musical characteristics make albums popular.
Skills: Shiny, R, ggplot2, Posit Connect Cloud
py_atmosphere Package
Summary: This package contains the simplified NASA’s GRC Earth Atmospheric model and calculates atmospheric air properties for an altitude of interest, and supports Mach number calculation for a moving object in space at the same altitude. In the context of aerospace engineering, a Standard Atmosphere model allows to calculate air properties to evaluate impact to aircraft operation.
Skills: poetry, cookiecutter, Python, pandas, pytest, Github Workflows, Continuous Development and Integration
UBC MDS Collaborators: Zhengling Jiang (@ClaireJ2100), Tianjiao Jiang (@tianjiao00).
Adult Income Predictor
Summary: Application of a K-Nearest Neighbors (KNN) Classifier to predict an individual’s annual income based on selected categorical demographic features using the Adult Dataset. The model achieved an accuracy of approximately 80%. This result emphasizes the importance of socioeconomic factors in determining income levels.
Skills: Make, Docker, Python, pandas, pytest, pandera, KNN Classifier
UBC MDS Collaborators: Michael Suriawan (@mikem2m), Quanhua Huang (@QuanhuaHuang-ubc), Tingting Chen (@Calista321).Pin by Denicon from Noun Project (CC BY 3.0)
Music by Lula Sugiantoro from Noun Project (CC BY 3.0)
Income by nuha from Noun Project (CC BY 3.0)
Airplane by Ales Studio from Noun Project (CC BY 3.0)