Miscellaneous

News & Travel

July 2019: I initialized a new Python toolbox, combo, for the easy use of combination methods in machine learning.

May 27th, 2019: Our paper on anomaly detection tool, PyOD, is published in Journal of Machine Learning Research (JMLR).


Fun Facts

I am an active software developer with more than 4,200 GitHub stars in total (top 1,400 among 37,000,000 GitHub developers ranked by Gitstar Ranking). I led multiple popular open-source machine learning initiatives, including PyOD (total downloads > 120,000 times), combo, anomaly-detection-resources, and awesome-ensemble-learning. Before coming to CMU, I have more than 5-year industry experience as a software engineer and management consultant. See my professional experience for more information.

I am a dedicated technical writer with more than 200 articles (in Chinese) and 80,000 followers on Zhihu (知乎) — Chinese Quora (200 million+ registered users). Since 2018, I have been officially recognized as a “Top Zhihu Writer” (优秀回答者) in four fields (AI, ML, DM, and STAT). See my Zhihu page.


Profile Pictures

If needed, high-resolution profile pictures can be downloaded here:

Publications

I am open to peer review chances in the field of outlier & anomaly detection, ensemble Learning, clustering, and ML systems. Please send me an email (zhaoy@cmu.edu) or a request in the corresponding reviewing system.

Journal Reviewer


Working Papers

[w18a] DivBoost: Constructing Effective Outlier Ensembles by Base Learner Diversity Maximization

[w19a] HD-Cluster: Synthesized Cluster Analysis and Outlier Detection on High-dimensional Data

Under Review

[w19c] Colin Wan, Zheng Li, Alicia Guo, Yue Zhao. [A new statistical model. *Name masked due to double blind review policy]. AAAI Conference on Artificial Intelligence (AAAI), 2020. Submitted, under review.

[w19d] Yue Zhao, Xuejian Wang*, Cheng Cheng*, Xueying Ding* [Combining Machine Learning Models and Scores using combo library] AAAI Conference on Artificial Intelligence (AAAI), demo track, 2020. Submitted, under review. (*equal contribution).



Quickly discover relevant content by filtering publications.

Peer-reviewed Papers

(2019). Music Artist Classification with Convolutional Recurrent Neural Networks. International Joint Conference on Neural Networks (IJCNN).

PDF Code Arxiv

(2018). DCSO: Dynamic Combination of Detector Scores for Outlier Ensembles. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Workshop on Outlier Detection De-constructed (ODD).

PDF Poster Slides

(2017). An empirical study of touch-based authentication methods on smartwatches. Proceedings of the 2017 ACM International Symposium on Wearable Computers (Equal contribution).

PDF DOI ACM DL

Software

I am an enthusiastic open-source developer: I build machine learning libraries and systems. Specifically, I initialized Python Outlier Detection library (PyOD) in 2018, which has become the most popular Python outlier detection toolkit. I also initialized combo: A Python Toolbox for Machine Learning Model Combination in July 2019–it is currently under active development. Watch/Star/Follow welcome!

A Python Toolbox for Machine Learning Model Combination.

PyOD–A Python Toolbox for Scalable Outlier Detection (Anomaly Detection).

Experience

Professional Positions

 
 
 
 
 

Senior Consultant

PwC Canada, Consulting & Deals

Feb 2017 – Jun 2019 Toronto, ON, Canada
I was a senior consultant with the following duties:

  • Designed fraud analytic solutions for major Canadian banks and insurance firms.
  • Led various applied data mining projects, e.g., client segmentation and churn analysis.
  • Developed multiple pricing optimization models with statistical methods.
 
 
 
 
 

Research Associate (Intern)

PwC Canada, Consulting & Deals

May 2016 – Dec 2016 Toronto, ON, Canada

Applied research in people analytics: build machine learning models for various people analytic projects.

Supervised by Prof. Anthony Bonner and the project is partly supported by Mitacs-Accelerate Research and Development Funding (IT07884).

 
 
 
 
 

Software Engineer (Intern & Contract)

Siemens PLM Software USA

Mar 2012 – Dec 2014 Cincinnati, Ohio, USA
As a co-op student and contractor, my works include:

  • Managed a Java project to transition the LabManager system to vCloud Director.
  • Refactored outdated automation code and added new modules and JUnit test cases.
  • Led a C++ Code Coverage project on Teamcenter platform to strengthen its stability.

Experience

Teaching Positions

 
 
 
 
 

Teaching Assistant

University of Toronto, Department of Computer Science

Sep 2015 – Dec 2015 Toronto, ON, Canada
I was a teaching assistant for Embedded Systems taught by Prof. Philip Anderson.
 
 
 
 
 

Teaching Assistant

University of Cincinnati, Department of Electrical Engineering & Computer Science

Sep 2014 – Dec 2014 Cincinnati, OH, USA
I was a teaching assistant for Introduction to Programming taught by Prof. George Purdy.

Funds and Awards

Mitacs-Accelerate Research and Development Funding

Project IT07884 ($30,000): machine learning in HR analytics.

Mantei/Mae Award & Scholar

Awarded to highest-performing students in Electrical Engineering, Computer Engineering, and Computer Science ($40,000 in four years).

University Global Award and Scholarship

Awarded to top performing international students ($32,000 in four years).

Contact

  • zhaoy@cmu.edu
  • Hamburgh Hall 2005, 4800 Forbes Ave, Pittsburgh, PA, USA, 15213
  • Wednesday 15:00 to 18:00
    Thursday 15:00 to 18:00
    Friday 15:00 to 18:00
    or by appointment