Aljosa's Web Corner

Hi, I'm Aljosa! I am a Senior Research Scientist at NVIDIA, working on learning to understand the dynamic world from raw, unlabeled streams of sensory data.

I come from the Alpine side of Slovenia. Prior, I obtained my M.Sc. degree at the University of Bonn, and a Ph.D. from the RWTH Aachen University under the supervision of Prof. Bastian Leibe. I have had a pleasure of working with Prof. Laura Leal-Taixe (Dynamic Vision and Learning Group) at Technical University of Munich and Prof. Deva Ramanan at the Robotics Institute of Carnegie Mellon University.

A. Ošep, T. Meinhardt, F. Ferroni, N. Peri, D. Ramanan, L. Leal-Taixe: Better Call SAL: Towards Learning to Segment Anything in Lidar, arXiv preprint arXiv:2403.13129, 2024.
paper

J. Seidenschwarz, A. Ošep, F. Ferroni, S. Lucey, L. Leal-Taixe: What Moves Together Belongs Together, Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
paper

C. Saltori, A. Ošep, E. Ricci, L. Leal-Taixé: Walking Your LiDOG: A Journey Through Multiple Domains for LiDAR Semantic Segmentation, International Conference on Computer Vision (ICCV), 2023.
video code paper

A. Agarwalla, X. Huang, J. Ziglar, F. Ferroni, L. Leal-Taixé, J. Hays, A. Ošep, D. Ramanan: Lidar Panoptic Segmentation and Tracking without Bells and Whistles, International Conference on Intelligent Robots and Systems (IROS), 2023.
page paper

X. Wu, K. Lau, F. Ferroni, A. Ošep, D. Ramanan: Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images, Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
poster video page paper

V. Fomenko, I. Elezi, D. Ramanan, L. Leal-Taixé, A. Ošep: Learning to Discover and Detect Objects, Neural Information Processing Systems (NeurIPS), 2022.
page poster video code paper

P. Dendorfer, V. Yugay, A. Ošep, L. Leal-Taixé: Quo Vadis: Is Trajectory Forecasting the Key Towards Long-Term Multi-Object Tracking?, Neural Information Processing Systems (NeurIPS), 2022.
video code paper

A. Kim, G. Brasó, A. Ošep, L. Leal-Taixé: PolarMOT: How far can geometric relations take us in 3D multi-object tracking?, European Conference on Computer Vision (ECCV), 2022.
video poster code paper

Q. Zhou, S. Agostinho, A. Ošep, L. Leal-Taixé: Is Geometry Enough for Matching in Visual Localization?, European Conference on Computer Vision (ECCV), 2022.
paper code

L. Nunes, X. Chen, R. Marcuzzi, A. Ošep, L. Leal-Taixé, C. Stachniss, J. Behley: Unsupervised Class-Agnostic Instance Segmentation of 3D LiDAR Data for Autonomous Vehicles, IEEE Robotics and Automation Letters (RA-L), 2022.
code paper

M. Gladkova, N. Korobov, N. Demmel, A. Ošep, L. Leal-Taixé, D. Cremers: DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment, International Conference on Intelligent Robots and Systems (IROS), 2022.
video paper page

N. Peri, J. Luiten, M. Li, A. Ošep, L. Leal-Taixé, D. Ramanan: Forecasting from LiDAR via Future Object Detection, Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
code paper

M. Kolmet, Q. Zhou, A. Ošep, L. Leal-Taixé: Text2Pos: Text-to-point-cloud cross-modal localization, Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
code paper

Y. Liu, I. Zulfikar, J. Luiten, A. Dave, D. Ramanan, B. Leibe, A. Ošep, L. Leal-Taixé: Opening up Open-World Tracking, Conference on Computer Vision and Pattern Recognition (CVPR), oral, 2022.
code paper

S. Agostinho, A. Ošep, A. Del Bue, L. Leal-Taixé: (Just) A Spoonful of Refinements Helps the Registration Error Go Down, International Conference on Computer Vision (ICCV) (oral), 2021.
code paper

M. Fabbri, G. Brasó, G. Maugeri, A. Ošep, R. Gasparini, O. Cetintas, S. Calderara, L. Leal-Taixé, R. Cucchiara: MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?, International Conference on Computer Vision (ICCV), 2021.
video paper

M. Aygün, A. Ošep, M. Weber, M. Maximov, C. Stachniss, J. Behley, L. Leal-Taixé: 4D Panoptic LiDAR Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
code poster video paper

A. Kim, A. Ošep, L. Leal-Taixé: EagerMOT: 3D Multi-Object Tracking via Sensor Fusion, IEEE International Conference on Robotics and Automation (ICRA), 2021.
code video paper

M. Weber, J. Xie, M. Collins, Y. Zhu, P. Voigtlaender, H. Adam, B. Green, A. Geiger, B. Leibe, D. Cremers, A. Os̆ep, L. Leal-Taixé, L. Chen: STEP: Segmenting and Tracking Every Pixel, NeurIPS Benchmarks and Datasets, 2022.
code paper

P. Dendorfer, A. Ošep, L. Leal-Taixé: Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation, Asian Conference on Computer Vision (ACCV), 2020.
page code video paper

P. Dendorfer, A. Ošep, A. Milan, K. Schindler, D. Cremers, I. Reid, S. Leal-Taixé: MOTChallenge: A Benchmark for Single-camera Multiple Target Tracking, International Journal of Computer Vision (IJCV), 2020.
paper

J. Luiten, A. Ošep, P. Dendorfer, P. Torr, A. Geiger, L. Leal-Taixé, B. Leibe: HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking, International Journal of Computer Vision (IJCV), 2020.
code blog paper

S. Mahadevan*, A. Athar*, A. Ošep, S. Hennen, L. Leal-Taixé, B. Leibe: Making a Case for 3D Convolutions for Object Segmentation in Videos, British Machine Vision Conference (BMVC), 2020.
code video paper

A. Athar*, S. Mahadevan*, A. Ošep, L. Leal-Taixé, B. Leibe: STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos, European Conference on Computer Vision (ECCV), 2020.
code video paper

Y. Xu, A. Ošep, Y. Ban, R. Horaud, L. Leal-Taixé, X. Alameda-Pineda: How To Train Your Deep Multi-Object Tracker, Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
code video paper

J. Gross, A. Ošep, B. Leibe: AlignNet-3D for Fast Point Cloud Registration of Partially Observed Objects, International Conference on 3D Vision (3DV), 2019.
code video poster paper

P. Voigtlaender, M. Krause, A. Ošep, J. Luiten, B. Sekar, A. Geiger, B. Leibe: {MOTS}: Multi-Object Tracking and Segmentation, Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
code video paper

A. Ošep, P. Voigtlaender, M. Weber, J. Luiten, B. Leibe: 4D Generic Video Object Proposals, International Conference on Robotics and Automation (ICRA), 2020.
code teaser video paper

A. Ošep, P. Voigtlaender, J. Luiten, S. Breuers, B. Leibe: Large-Scale Object Mining for Object Discovery from Unlabeled Video, International Conference on Robotics and Automation (ICRA), 2019.
video paper

A. Ošep, W. Mehner, P. Voigtlaender, B. Leibe: Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking, International Conference on Robotics and Automation (ICRA), 2018.
code video paper

A. Ošep, P. Voigtlaender, J. Luiten, S. Breuers, B. Leibe: Towards Large-Scale Video Object Mining, ECCV 2018 Workshop on Interactive and Adaptive Learning in an Open World, 2018.
paper

A. Ošep, W. Mehner, M. Mathias, B. Leibe: Combined Image- and World-Space Tracking in Traffic Scenes, International Conference on Robotics and Automation (ICRA), 2017.
teaser code video paper

D. Klostermann, A. Ošep, J. Stueckler, B. Leibe: Unsupervised Learning of Shape-Motion Patterns for Objects in Urban Street Scenes, British Machine Vision Conference (BMVC), 2016.
video paper

D. Kochanov, A. Ošep, J. Stueckler, B. Leibe: Scene Flow Propagation for Semantic Mapping and Object Discovery in Dynamic Street Scenes, International Conference on Intelligent Robots and Systems (IROS), 2016.
video paper

A. Ošep, A. Hermans, F. Engelmann, D. Klostermann, M. Mathias, B. Leibe: Multi-Scale Object Candidates for Generic Object Tracking in Street Scenes, International Conference on Robotics and Automation (ICRA), 2016.
paper

D. Mitzel, J. Diesel, A. Ošep, U. Rafi, B. Leibe: A Fixed-Dimensional 3D Shape Representation for Matching Partially Observed Objects in Street Scenes, International Conference on Robotics and Automation (ICRA), 2015.
paper

M. Weinmann, A. Ošep, R. Ruiters, R. Klein: Multi-View Normal Field Integration for 3D Reconstruction of Mirroring Objects, International Conference on Computer Vision (ICCV), 2013.
paper

M. Weinmann, R. Ruiters, A. Ošep, C. Schwartz, R. Klein: Fusing Structured Light Consistency and Helmholtz Normals for 3D Reconstruction, British Machine Vision Conference (BMVC), 2012.
paper

Aljosa Osep, Ph.D.

News

Students Supervised

Service

Talks

Teaching

Publications