Markdown Monster icon

About

I am Jinpu Cao, a graduate student in the Sustainable Design and Construction Program of the Civil and Environmental Engineering Department at Stanford University (2020 - 2022). My research interests lie in applying data-driven methods and Artificial Intelligence (AI) to Architecture Engineering and Construction (AEC) to help us create smart and sustainable infrastructures and living environments.

Education Experience

Conferences

  1. Speaker in the 4th International Conference on Information Technology in Geo-Engineering (4ICITG)

    The conference was organized by Geotechnical Society of Singapore in August 2022.

    Presentation: “A Long-term Probabilistic Forecasting Approach of TBM Operating Parameters based on Deep Learning” (video, slides, code)

  2. Speaker in the II International Geo-science Machine Learning Big Data Seminar.

    Hosted by ISSMGE Technical Committee of Machine Learning and Big Data (TC309) Risk and Insurance Research Branch of China Civil Engineering Society in July 2019, Shanghai.

    Presentation: “Predicting TBM Performance using Machine Learning: is Surrounding Rock Information Important.

Publications

  1. An LSTM-based model for TBM performance prediction and the effect of rock mass grade on prediction accuracy (China Civil Engineering Journal, first author, accepted)

  2. PigSense: Vibration-based Activity and Health Monitoring System for Pigs (ACM Journals, co-author, in review)

  3. Improving adaptation to wildfire smoke and extreme heat in vulnerable communities: Evidence from a pilot study in the San Francisco Bay Area (Environmental Research Letters, contributor, ready to submit)

Research Projects

  1. Computer Vision-based Pavement Distress Detection System (March 2022 - Now, Stanford University)

    Key Words: Computer Vision, Crack Detection, Web Development, Business Value, Deep Learning     (Python, TensorFlow, Dash)

    Worked as a research assistant advised by Pooja Jain (V.P. Strategic Innovation in WSP) and Dr. Martin Fischer (Professor at Stanford).

    This is a significant exploration of developing the practical workflow of an end-to-end vision-based automatic pavement distress detection system (demonstration video). Also, I am responsible for figuring out its potential business value.

  2. Pilot Air Quality Analysis in the Bay Area (March 2022 - Now, Stanford University)

    Key Words: Data Manipulation, Time Series Decomposition, Clustering, Regressive Analysis, Community-engaged    (R)

    Worked as a research assistant advised by Derek Ouyang (Research Manager at the RegLab, Stanford) and Dr. Gabrielle Wong-Parodi (Professor at Stanford).

    This community-engaged pilot study aims to improve adaptation to wildfire smoke and extreme heat in vulnerable communities. I am responsible for characterizing these communities’ air quality and exploring their human-induced and environment-induced influence. Our team characterized the relationship between indoor and outdoor air quality with a “spike lag” model and verified it with actual monitoring data. (Pilot Report)

  3. PigSense: Structural Vibration-based Activity and Health Monitoring System for Pigs (Jan 2022 - Sept 2022, Stanford University)

    Key Words: Signal Processing, Classification, Machine Learning     (MATLAB, Python, scikit-learn)

    Worked as a research volunteer advised by Dr. Hae Young Noh (Professor at Stanford) in Stanford Structures as Sensors Lab

    The project introduced the first system-PigSense to use structural vibration to track animals. The system uses physical knowledge of the structural vibration characteristics caused by pig-activity-induced load changes to recognize different behaviors of the sow and piglets. I am responsible for applying machine learning to automate the characterization of the piglet group activities, including nursing, sleeping, and being active base on vibration data.

  4. Sustainable Urban System Projects (Sept 2021 - June 2022, Stanford University)

    Key Words: Data Manipulation and Visualization, Geospatial Data, Census Data, Equity analysis, Monte Carlo simulations, Regression, Causality Analysis, Web Application     (R, Shiny)

    The Sustainable Urban Systems (Stanford CEE 218, Shaping the Future of the Bay Area, advised by Derek Ouyang and Dr. Jenny Suckale) emphasis merges traditional data analytics with complex systems analysis to better inform decisions around the wicked problems of urban development like urban land use, hazard analysis, amenity accessibility, equity implications of air quality and emission analysis.

  5. Fortuna Cools Life Cycle Assessment (Sept 2020 - Jan 2021, Stanford University)

    Key Words: Life Cycle Assessment, Sensitivity Analysis     (R, SimaPro)

    Fortuna Cools (Fortuna) was founded in 2018 with the goal of creating an affordable and environmentally sustainable option for coolers for fishermen. To date, they have operated in the Philippines, producing coolers made in large part of coconut husk, a common waste product. The coolers are intended as an alternative for expanded polystyrene (EPS) coolers with a usable life of just two weeks, or as alternatives to reusable polyurethane coolers that are often too expensive to be a feasible option for fishermen. Stanford team and Fortuna team worked together and conducted a formal life cycle assessment (Stanford CEE 226: Life Cycle Assessment for Complex Systems, advised by Dr. Michael Lepech) of Fortuna coolers. We determined that the Fortuna cooler life cycle is associated with drastically lower greenhouse gas emissions and is comparable to or has slightly higher emissions than EPS coolers in several impact categories. The capability of coconut husks to be degraded and converted into a biodegradable compost and fertilizer minimizes the energy requirement towards the end-of-life disposal; the ability to compost the Fortuna cooler provides additional benefit over EPS. (Poster, Report)

  6. Battery Lifetime Prediction with Limited Cycle Data (Sept 2021 - Jan 2022, Stanford University)

    Key Words: Battery Lifetime Prediction, CNN, Bi-LSTM, Confidence Interval     (Python, TensorFlow, PyTorch).

    Accurately predicting the remaining useful lifetime of batteries is critical for accelerating technological development and creating a paradigm shift in battery usage. Data-driven approaches,based on large datasets, provide a physical-model agnostic way to predict the health status of batteries with high accuracy. However, most datadriven methods on battery life prediction often rely on features extracted from a hundred cycles worth of data for a given cell, making it computationally inefficient and incompatible with on-board application.

    The course project (Stanford CS 329P Practical Machine Learning, advised by Dr. Mu Li) applied machine-learning models, including linear regression, random forest regression, convolutional neural networks, and recurrent neural networks to make predictions on cell life. Our best model achieve a 7.5% prediction error given the data of only 5 cycles. (Report, Slides, Video)

  7. Long term probability prediction platform of urban water consumption (Oct 2020 - Aug 2021, Tongji University)

    Key Words: Time Series, Probabilistic Prediction, DeepAR     (Python, GluonTS)

    Worked as a research assistant advised by Dr. Fang Liu (Professor at Tongji University)

    This was a practical and valuable project to build an intelligent diagnosis, risk reasoning, and decision support system for municipal facilities. I independently developed a water consumption probabilistic prediction model based on the deep autoregressive (DeepAR) algorithm. The model has been integrated into the Shanghai Smart Intelligent Platform to predict daily water consumption and provide plumbing burst alarming services. (Report, Slides, Codes)

  8. Tunnel Boring Machine (TBM) Operation Parameters Prediction (Nov 2018 - Aug 2019, Tongji University)

    Key Words: TBM, Parameter Prediction, LSTM, Data Augmentation     (Python, Keras)

    Worked as a research assistant advised by Dr. Fang Liu (Professor at Tongji University)

    The project aims to predict TBM operation parameters based on its historical data, promoting safe and efficient tunneling construction of TBM. I independently developed TBM parameters and geological prediction model based on Long Short-Term Memory (LSTM) network , and won the National Third Place in the data mining competition organized by the Chinese Society of Rock Mechanics and Engineering.

  9. Machine Learning Contest: Infrared Spectrum Classification (July 2022, Stanford University)

    Key Words: Infrared Spectrum Classification, Machine Learning     (Python, AutoGluon).

    The Near Infrared Spectroscopy Branch of the China Instrument Society holds the data modeling contest. The organizer provides a set of near-infrared spectral data from the actual application scenario. Different data preprocessing technologies (e.g. PCA, normalization) and machine learning models (AutoGluon) were tried for the unfrared spectrum classification problem.

Honors

Stanford University

    2020-2022 Outstanding Project of SFBI

Tongji University

    2017-2020 Outstanding Graduates in Shanghai (Top 5% of 220,000 students)

                The First Prize Merit-Scholarship of Tongji University (Top 5% of 400 Students)

                Shanghai Scholarship (Top 5% of 220,000 students)

                Excellent Students of Tongji University (Top 3% of 4000 students)

                The Second Prize of Structure Design and Model Competition

    National College Students’ Mathematics Competition

                2016-2017 The Second Prize of Shanghai Division

    National College Students’ Mathematical Modeling Competition

                2016-2017 The Third Prize in the Shanghai Division

Internship

Shanghai Shentong Metro Group (Shanghai, June 2019-Aug 2019)

Subway Monitoring Intern

This was an interesting practical expedition to assist in monitoring and analyzing the subway deformation and settlement near an ultra-deep foundation pit (Xuhui Center, Shanghai). I processed and visualized the subway deformation monitoring data based on python and proposed corresponding corrective measures to the construction unit.

CV & Bio

Download my CV here. Last updated Nov 2022.

Email:

GitHub: https://github.com/J-i-n-p-u

Twitter: @Jinpu_C

Linkedin: https://www.linkedin.com/in/jinpu-cao-a003bb1b6