IGEM
IGEM
Apply as a Developer
AI Research Engineer

AI Research Engineer

Verified Expert in AI Research

  • AI Research Engineer
  • Boston, MA, USA
  • Member since January 10, 2019

Bio

Dr. Michael Chen specializes in developing cutting-edge AI models and conducting research in natural language processing and computer vision. With a PhD in Machine Learning and extensive publication record, he bridges the gap between academic research and production systems. His work focuses on making AI more efficient, interpretable, and accessible.

Portfolio

Large Language Model Fine-tuning

PyTorch, Transformers, PEFT, LoRA, Distributed Training, MLflow

Computer Vision Pipeline

TensorFlow, OpenCV, YOLO, Image Segmentation, Real-time Processing

NLP Research Platform

Python, BERT, GPT, Hugging Face, Named Entity Recognition, Sentiment Analysis

Experience

Machine Learning – 8 years
Python – 9 years
PyTorch – 6 years
TensorFlow – 7 years
NLP – 6 years
Computer Vision – 5 years
Deep Learning – 7 years
Research – 8 years

Availability

Contract/Consulting

Preferred Environment

Research Labs, Remote

The most amazing…

…achievement was publishing breakthrough NLP research at NeurIPS 2023 with 500+ citations.

Senior AI Research Engineer OpenAI Labs

2022 – 2024
  • Led research on efficient fine-tuning methods for large language models, reducing training costs by 70%.
  • Published 8 papers at top-tier conferences (NeurIPS, ICML, ACL) with 1000+ combined citations.
  • Developed novel architecture improvements achieving SOTA results on multiple benchmarks.
  • Collaborated with product teams to deploy research models serving 10M+ users.

Technologies: PyTorch, Transformers, CUDA, Distributed Training, MLflow, Weights & Biases, Python, C++

Machine Learning Research Scientist AI Research Institute

2019 – 2022
  • Conducted foundational research in natural language understanding and multimodal learning.
  • Built efficient training pipelines reducing experiment iteration time by 50%.
  • Mentored PhD students and contributed to grant proposals securing $2M in funding.
  • Open-sourced research code and models adopted by 1000+ researchers globally.

Technologies: TensorFlow, PyTorch, JAX, Python, NumPy, Pandas, Scikit-learn

PhD Research Assistant MIT CSAIL

2015 – 2019
  • Conducted doctoral research in deep learning for natural language processing.
  • Published 6 peer-reviewed papers at major AI conferences.
  • Developed novel attention mechanisms later adopted in production systems.

Technologies: Python, TensorFlow, Theano, NLTK, SpaCy, Research Methodology

2015 – 2019

PhD in Computer Science (Machine Learning)

Massachusetts Institute of Technology – Cambridge, MA

Machine Learning

Deep Learning, Neural Networks, Transformers, CNNs, RNNs, GANs, Reinforcement Learning, Transfer Learning

Languages

Python, C++, Julia, R, SQL, CUDA

Frameworks

PyTorch, TensorFlow, JAX, Hugging Face Transformers, scikit-learn, XGBoost, LightGBM

NLP

BERT, GPT, T5, LLaMA, Tokenization, Named Entity Recognition, Sentiment Analysis, Machine Translation

Computer Vision

OpenCV, YOLO, ResNet, Vision Transformers, Image Segmentation, Object Detection, GANs

Hiring Made Easy

1
2
3
Hired
1

Talk to Our Recruitment Expert

An expert on our team will work with you to understand your goals, technical needs, and team dynamics.

2

Get Shortlisted & Already Vetted Candidates

We will share the top talent that matches your exact requirements. Already tested with complete interview details.

3

Interview & Finalise the Ones You Like

You can test them again by interviewing as you like before finalising. You get complete support from us in the interview process.

Don’t miss out on industry’s top talent

Hire Now
IGEM
IGEM is a platform that helps you hire top 3% AI & JS developers in less than 24 hours.

Tools

Jupyter, MLflow, Weights & Biases, TensorBoard, Docker, Git, Linux, AWS SageMaker

Data Processing

NumPy, Pandas, Spark, Dask, Data Augmentation, Feature Engineering

Other

Research Methodology, Paper Writing, Statistical Analysis, A/B Testing, Model Optimization, Distributed Training, MLOps