About

Hello, I'm Saeed.

I'm a senior machine learning engineer at Roper Technologies, working on agentic AI. I like working end to end, from research and modeling through to shipping things that hold up in production.

My background spans deep learning, computer vision, and generative models. PhD from York University; previously a research scientist at Wētā FX, Amazon Games, and Ubisoft La Forge.

Saeed Ghorbani

Senior Machine Learning Engineer

Roper Technologies

Toronto, Canada

Contact →

Timeline

News & milestones

Research

Selected publications

2025

arXiv

Aether Weaver: Multimodal Affective Narrative Co-Generation with Dynamic Scene Graphs

Saeed Ghorbani

arXiv 2025

PDF

2024

ECCV

Real-Time Neural Cloth Deformation using a Compact Latent Space and a Latent Vector Predictor

Chanhaeng Lee, Mykhailo Perepichka, Saeed Ghorbani, Sudhir Mudur, Eric Paquette, Tiberiu Popa

European Conference on Computer Vision (ECCV) 2024

PDF
ECCVW

SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers

Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad

ECCV Workshop on CV for Metaverse 2024

PDF WEBSITE
ECCV

UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues

Vandad Davoodnia, Saeed Ghorbani, Marc-André Carbonneau, Alexandre Messier, Ali Etemad

European Conference on Computer Vision (ECCV) 2024

PDF WEBSITE

2023

CGF

ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech

Saeed Ghorbani, Ylva Ferstl, Daniel Holden, Nikolaus F. Troje, Marc-André Carbonneau

Computer Graphics Forum 2023

PDF CODE WEBSITE

2022

ICMI

Exemplar-based Stylized Gesture Generation from Speech: An Entry to the GENEA Challenge 2022

Saeed Ghorbani, Ylva Ferstl, Marc-André Carbonneau

International Conference on Multimodal Interaction (ICMI) 2022

PDF
APIN

Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators

Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

Applied Intelligence (Springer) 2022

2021

ICPR

Gait Recognition using Multi-Scale Partial Representation Transformation with Capsules

Alireza Sepas-Moghaddam, Saeed Ghorbani, Nikolaus F. Troje, Ali Etemad

International Conference on Pattern Recognition (ICPR) 2021

PDF
ICASSP

In-bed Pressure-based Pose Estimation using Image Space Representation Learning

Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021

PDF
PLOS ONE

MoVi: A Large Multi-Purpose Human Motion and Video Dataset

Saeed Ghorbani, Kimia Mahdaviani, Anne Thaler, Konrad Kording, Douglas James Cook, Gunnar Blohm, Nikolaus F. Troje

PLOS ONE 2021

PDF CODE WEBSITE

2020

CGF

Probabilistic Character Motion Synthesis using a Hierarchical Deep Latent Variable Model

Saeed Ghorbani, Calden Wloka, Ali Etemad, Marcus A. Brubaker, Nikolaus F. Troje

Computer Graphics Forum (Symposium on Computer Animation) 2020

PDF WEBSITE

2019

CGI

Auto-labelling of Markers in Optical Motion Capture by Permutation Learning

Best Paper Award

Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje

Computer Graphics International (CGI) 2019

PDF
CVR

Automatic Initialization and Tracking of Markers in Optical Motion Capture by Learning to Rank

Best Poster Award

Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje

CVR Vision Conference 2019

PDF

2010

WCSP

Sub-pixel Image Registration based on Physical Forces

Ali Ghayoor, Saeed Ghorbani, Ali Asghar Beheshti Shirazi

International Conference on Wireless Communications & Signal Processing (WCSP) 2010

PDF

Work

Projects

Probabilistic Motion Model

Probabilistic Motion Model

Probabilistic character motion synthesis using a hierarchical deep latent variable model. A framework that generates realistic and diverse character animations from weak control signals while preserving the stochastic nature of human movement.

  • motion synthesis
  • character animation
  • variational autoencoder
  • deep learning
View →
MoVi Dataset

MoVi Dataset

A large multi-purpose human motion and video dataset with synchronized pose, body meshes, and video recordings. It contains 90 actors performing 20+ everyday and sports movements, captured with optical motion capture, multi-view video, and IMU sensors.

  • motion capture
  • dataset
  • pose estimation
  • computer vision
View →
ZeroEGGS

ZeroEGGS

Zero-shot example-based gesture generation from speech with style control. A neural network framework that generates full-body co-speech gestures, including finger-level detail, with style controlled by a short example motion clip even for styles unseen during training.

  • gesture generation
  • speech
  • character animation
  • deep learning
View →