Leonid Shamis

Email:
Phone: (916) 402-0199
GitHub: lshamis   alephzero

Staff Software Engineer with expertise in machine learning, robotics, NLP, and distributed systems, with a proven track record of developing and deploying advanced AI systems.

Skills

Programming Languages: C/C++, Python, JavaScript, Go, Rust, Java, C#
Machine Learning & AI: PyTorch, TensorFlow, SIMD, NLP, Robotics
Systems & Infrastructure: Linux, Containers (Docker/Apptainer/Enroot), SLURM, Ray, Distributed Systems
Development Tools: Git, Bash, Cross-Language Integration (JNI, PyBind, CGo), Performance Optimization

Work History

Meta
August 2021 - Present

FAIR - Staff Research Engineer June 2023 - Present

Led Chameleon (multi-modal foundation model) inference optimization, demo infrastructure, and open-source release with HuggingFace.

Developed ALMA, a low-annotation LLM alignment method enhancing controllability.

Primary maintainer of FAIR's new LLM training codebase to support future model development. Hopefully to be open sourced soon.

FAIR - Research Engineering Manager May 2022 - June 2023

Managed research teams for mobility, manipulation, and home robotics.

FAIR - Staff Research Engineer August 2021 - May 2022

Led development of the Meta Robotics Platform (MRP) for scalable robotics research.

Third Wave Automation
Feb 2019 - July 2021

Director of Engineering August 2020 - July 2021

Led teams developing autonomous vehicle motion planning, SLAM, and orchestration.

Principal Engineer February 2019 - August 2020

Designed scalable motion planning and perception systems for autonomous forklifts.

Artificial
Sept 2017 - Feb 2019

Robot Intelligence Infrastructure

Founding engineer. Built unified robotic data transfer and orchestration system.

drive.ai
Nov 2016 - Aug 2017

Motion Planning

Led motion planning team, designing simulation tools and semantic map representations. Developed algorithms for route planning and vehicle dynamics.

Google
Feb 2011 - Nov 2016

Google Brain October 2015 - November 2016

Researched reinforcement learning and deep learning methods for large-scale machine learning applications.

Google[X] April 2015 - October 2015

Developed control software for multi-agent robotic systems and designed safety mechanisms for experimental hardware.

Language Understanding February 2014 - April 2015

Built NLP models for sentiment analysis, QA, and personalized search. Designed systems to extract testimonials and opinions from Play Store comments.

Search Ranking Engine Analytics February 2011 - February 2014

Built large-scale data analysis pipelines and A/B testing infrastructure for Google's search ranking. Developed automated deployment, monitoring, and traffic prediction systems.

Education

University of California, Davis

MS in Computer Science

Fall 2009 - Winter 2010

BS with Honors, Computer Science

Summer 2007 - Spring 2009

Publications

Constraint Based Framework for Optimal k-Clustering.
Master's Thesis. University of California, Davis, 2010.

A SAT-based Framework for Efficient Constrained Clustering.
Davidson, Ian, Ravi, S. S. and Shamis, Leonid. SDM, 2010.

Chameleon: Mixed-Modal Early-Fusion Foundation Models.
Chameleon Team. arXiv:2405.09818, May 2024.

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model.
Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy. arXiv:2408.11039, August 2024.

ALMA: Alignment with Minimal Annotation.
Michihiro Yasunaga, Leonid Shamis, Chunting Zhou, Andrew Cohen, Jason Weston, Luke Zettlemoyer, Marjan Ghazvininejad. arXiv:2412.04305, December 2024.