Saeed Ghorbani — Machine Learning Engineer & Scientist

About

Hello, I'm Saeed.

I'm a senior machine learning engineer at Roper Technologies, working on agentic AI. I like working end to end, from research and modeling through to shipping things that hold up in production.

My background spans deep learning, computer vision, and generative models. PhD from York University; previously a research scientist at Wētā FX, Amazon Games, and Ubisoft La Forge.

Senior Machine Learning Engineer

Roper Technologies

Toronto, Canada

Contact →

Timeline

News & milestones

May 1, 2026 I joined Roper Technologies as a Senior Machine Learning Engineer, working on agentic AI.
Feb 1, 2026 I joined Wētā FX as a Senior Research Scientist.
Jan 15, 2025 I joined Amazon Games as a Research Scientist.
Jul 19, 2024 Our paper on human pose estimation with cross-view and temporal cues has been accepted to ECCV 2024.
May 15, 2023 ZeroEGGS is featured on the Ubisoft La Forge blog.
Mar 13, 2023 Zero-shot Example-based Gesture Generation from Speech is published at Computer Graphics Forum. Link to the paper
Nov 28, 2022 I will give a talk at SIGGRAPH ASIA on Thursday, 8 December 2022, 5:00pm - 6:00pm KST for the Games Feature Sessions. Details.
Nov 28, 2022 ZEGGS is featured in the Two Minute Papers channel.
Oct 12, 2022 We published our work on Zero-shot Example-based Gesture Generation from Speech including code, dataset, and preprint.
Apr 25, 2022 I defended my PhD thesis.
Apr 12, 2022 I joined Ubisoft La Forge as an R&D Scientist.
Apr 8, 2021 Our paper "Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators" was accepted at the Journal of Applied Intelligence, Springer.
Jan 27, 2021 Our paper "In-bed Pressure-based Pose Estimation using Image Space Representation Learning" was accepted at ICASSP2021.
Oct 10, 2020 Our paper "Gait Recognition using Multi-Scale Partial Representation Transformation with Capsules" was accepted at ICPR2020.
Oct 1, 2020 Our new paper on Motion Modelling is now published at Computer Graphics Forum. Watch the short and long presentations.
Mar 6, 2020 MoVi: We just published a BIG new dataset of human motion data. 9h mocap + 17h calibrated video + 7h of IMU + MoSh reconstructed body shape. Check the website.
Jul 16, 2019 Our work on Auto-labelling of markers in optical motion capture is highlighted on YorkU VISTA news.
Jun 20, 2019 Our work on Auto-labelling of markers in optical motion capture won the best full paper award at the International Conference on Computer Graphics.
May 8, 2019 Our work on Auto-labelling of markers in optical motion capture won 2nd Place Computer Vision Poster Award at the CVR-VISTA International Conference on Predictive Vision.

Research

Selected publications

2025

arXiv

Aether Weaver: Multimodal Affective Narrative Co-Generation with Dynamic Scene Graphs

Saeed Ghorbani

arXiv 2025

PDF

@article{aether-weaver-2025,
  title   = {Aether Weaver: Multimodal Affective Narrative Co-Generation with Dynamic Scene Graphs},
  author  = {Saeed Ghorbani},
  journal = {arXiv},
  year    = {2025}
}

2024

ECCV

Real-Time Neural Cloth Deformation using a Compact Latent Space and a Latent Vector Predictor

Chanhaeng Lee, Mykhailo Perepichka, Saeed Ghorbani, Sudhir Mudur, Eric Paquette, Tiberiu Popa

European Conference on Computer Vision (ECCV) 2024

PDF

@article{neural-cloth-deformation-2024,
  title   = {Real-Time Neural Cloth Deformation using a Compact Latent Space and a Latent Vector Predictor},
  author  = {Chanhaeng Lee, Mykhailo Perepichka, Saeed Ghorbani, Sudhir Mudur, Eric Paquette, Tiberiu Popa},
  journal = {European Conference on Computer Vision (ECCV)},
  year    = {2024}
}

ECCVW

SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers

Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad

ECCV Workshop on CV for Metaverse 2024

PDF WEBSITE

@article{skelformer-2024,
  title   = {SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers},
  author  = {Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad},
  journal = {ECCV Workshop on CV for Metaverse},
  year    = {2024}
}

ECCV

UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues

Vandad Davoodnia, Saeed Ghorbani, Marc-André Carbonneau, Alexandre Messier, Ali Etemad

European Conference on Computer Vision (ECCV) 2024

PDF WEBSITE

@article{upose3d-2024,
  title   = {UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues},
  author  = {Vandad Davoodnia, Saeed Ghorbani, Marc-André Carbonneau, Alexandre Messier, Ali Etemad},
  journal = {European Conference on Computer Vision (ECCV)},
  year    = {2024}
}

2023

CGF

ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech

Saeed Ghorbani, Ylva Ferstl, Daniel Holden, Nikolaus F. Troje, Marc-André Carbonneau

Computer Graphics Forum 2023

PDF CODE WEBSITE

@article{zeroeggs-2023,
  title   = {ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech},
  author  = {Saeed Ghorbani, Ylva Ferstl, Daniel Holden, Nikolaus F. Troje, Marc-André Carbonneau},
  journal = {Computer Graphics Forum},
  year    = {2023}
}

2022

ICMI

Exemplar-based Stylized Gesture Generation from Speech: An Entry to the GENEA Challenge 2022

Saeed Ghorbani, Ylva Ferstl, Marc-André Carbonneau

International Conference on Multimodal Interaction (ICMI) 2022

PDF

@article{exemplar-gesture-genea-2022,
  title   = {Exemplar-based Stylized Gesture Generation from Speech: An Entry to the GENEA Challenge 2022},
  author  = {Saeed Ghorbani, Ylva Ferstl, Marc-André Carbonneau},
  journal = {International Conference on Multimodal Interaction (ICMI)},
  year    = {2022}
}

APIN

Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators

Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

Applied Intelligence (Springer) 2022

@article{smart-beds-pose-2022,
  title   = {Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators},
  author  = {Vandad Davoodnia, Saeed Ghorbani, Ali Etemad},
  journal = {Applied Intelligence (Springer)},
  year    = {2022}
}

2021

ICPR

Gait Recognition using Multi-Scale Partial Representation Transformation with Capsules

Alireza Sepas-Moghaddam, Saeed Ghorbani, Nikolaus F. Troje, Ali Etemad

International Conference on Pattern Recognition (ICPR) 2021

PDF

@article{gait-recognition-icpr-2021,
  title   = {Gait Recognition using Multi-Scale Partial Representation Transformation with Capsules},
  author  = {Alireza Sepas-Moghaddam, Saeed Ghorbani, Nikolaus F. Troje, Ali Etemad},
  journal = {International Conference on Pattern Recognition (ICPR)},
  year    = {2021}
}

ICASSP

In-bed Pressure-based Pose Estimation using Image Space Representation Learning

Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021

PDF

@article{in-bed-pose-icassp-2021,
  title   = {In-bed Pressure-based Pose Estimation using Image Space Representation Learning},
  author  = {Vandad Davoodnia, Saeed Ghorbani, Ali Etemad},
  journal = {IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year    = {2021}
}

PLOS ONE

MoVi: A Large Multi-Purpose Human Motion and Video Dataset

Saeed Ghorbani, Kimia Mahdaviani, Anne Thaler, Konrad Kording, Douglas James Cook, Gunnar Blohm, Nikolaus F. Troje

PLOS ONE 2021

PDF CODE WEBSITE

@article{movi-2021,
  title   = {MoVi: A Large Multi-Purpose Human Motion and Video Dataset},
  author  = {Saeed Ghorbani, Kimia Mahdaviani, Anne Thaler, Konrad Kording, Douglas James Cook, Gunnar Blohm, Nikolaus F. Troje},
  journal = {PLOS ONE},
  year    = {2021}
}

2020

CGF

Probabilistic Character Motion Synthesis using a Hierarchical Deep Latent Variable Model

Saeed Ghorbani, Calden Wloka, Ali Etemad, Marcus A. Brubaker, Nikolaus F. Troje

Computer Graphics Forum (Symposium on Computer Animation) 2020

PDF WEBSITE

@article{probabilistic-character-motion-2020,
  title   = {Probabilistic Character Motion Synthesis using a Hierarchical Deep Latent Variable Model},
  author  = {Saeed Ghorbani, Calden Wloka, Ali Etemad, Marcus A. Brubaker, Nikolaus F. Troje},
  journal = {Computer Graphics Forum (Symposium on Computer Animation)},
  year    = {2020}
}

2019

CGI

Auto-labelling of Markers in Optical Motion Capture by Permutation Learning

Best Paper Award

Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje

Computer Graphics International (CGI) 2019

PDF

@article{auto-labelling-markers-cgi-2019,
  title   = {Auto-labelling of Markers in Optical Motion Capture by Permutation Learning},
  author  = {Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje},
  journal = {Computer Graphics International (CGI)},
  year    = {2019}
}

CVR

Automatic Initialization and Tracking of Markers in Optical Motion Capture by Learning to Rank

Best Poster Award

Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje

CVR Vision Conference 2019

PDF

@article{marker-tracking-learning-to-rank-cvr-2019,
  title   = {Automatic Initialization and Tracking of Markers in Optical Motion Capture by Learning to Rank},
  author  = {Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje},
  journal = {CVR Vision Conference},
  year    = {2019}
}

2010

WCSP

Sub-pixel Image Registration based on Physical Forces

Ali Ghayoor, Saeed Ghorbani, Ali Asghar Beheshti Shirazi

International Conference on Wireless Communications & Signal Processing (WCSP) 2010

PDF

@article{subpixel-image-registration-wcsp-2010,
  title   = {Sub-pixel Image Registration based on Physical Forces},
  author  = {Ali Ghayoor, Saeed Ghorbani, Ali Asghar Beheshti Shirazi},
  journal = {International Conference on Wireless Communications & Signal Processing (WCSP)},
  year    = {2010}
}

Work

Projects

Probabilistic Motion Model

Probabilistic character motion synthesis using a hierarchical deep latent variable model. A framework that generates realistic and diverse character animations from weak control signals while preserving the stochastic nature of human movement.

motion synthesis
character animation
variational autoencoder
deep learning

View →

MoVi Dataset

A large multi-purpose human motion and video dataset with synchronized pose, body meshes, and video recordings. It contains 90 actors performing 20+ everyday and sports movements, captured with optical motion capture, multi-view video, and IMU sensors.

motion capture
dataset
pose estimation
computer vision

View →

ZeroEGGS

Zero-shot example-based gesture generation from speech with style control. A neural network framework that generates full-body co-speech gestures, including finger-level detail, with style controlled by a short example motion clip even for styles unseen during training.

gesture generation
speech
character animation
deep learning

View →