My name is Andriy Mulyar and I am undergraduate passionate in mathematics and computer science. My research interests reside in the intersection of machine learning, statistical learning and natural language processing. I particularly enjoy tackling interesting problems in text mining and information extraction.

The below projects encapsulate big-picture goals I am currently pursuing or have pursued.

Current Projects


Multi-label Document Classification with BERT
Language model powered long document classification architectures. (NeurIPS ML4Health 2019).
September 2019
Clinical Semantic Similarity
Training language models (BERT) towards associating semantic equivalence in clinical notes.
July 2019
medaCy: Medical Text Mining and NLP Framework
medaCy is a highly predictive text processing and NLP research framework built over spaCy that leverages cutting-edge tools for mining medical text.
August 2018

Inactive Projects

Clinical Concept Normalization and Extraction
Applying neural ranking to map unstructured text in clinical notes and electronic health records to structured medical ontologies.
June 2019
Automatic Graph Conjecturing
A service auto-conjecturing over graphs to empirically discover novel relations between graph theoretic properties and invariants. A project under Dr. Craig Larson.
May 2019
Decision Trees: Exploiting Local Data Properties and Nested Ensembles
Trees are excellent learners: simplistic, interpretable and versatile. This project explores their interaction with local data characteristics to improve predictive performance and interpretability.
March 2018
Gateway Math
A software for mathematics educators to generate dynamic worksheets.
January 2017
Reproducible Machine Learning
Effective methods to maintain replicable and reproducible research environments in computational science domains.
November 2018