Career Profile

PhD in Computer Vission & Machine Learning (TU Berlin), with a focus on context-aware facial expression recognition using multi-modal deep learning. Experienced in interdisciplinary research, project management and team leadership. Seeking a posiLon as a machine learning research scientist or engineer.

Work Experience

Ph.D. Researcher

03/2020 - 01/2025
Technical University of Berlin
  • Dissertation: “Knowledge-Augmented and Context-Sensitive Face Perception” within interdisciplinary excellence cluster Science of Intelligence
  • Goal: Develop multi-modal deep-learning models for context-sensiGve facial expression recognition, integrating vision, audio, and language.
  • Developed a theoretical framework for the topic from a joint psychological – ML viewpoint, combined with a literature analysis (Consciousness and Cognition ‘22).
  • Developed a multi-task multi-modal self-supervised learning (SSL) approach to shape representation space. Combined vision models with an audio model and an LLM, improved accuracy by more than 5% over other SSL approaches that use the same modalities (CVPR’24).
  • Fused generative AI models with a self-attention and a classification module to classify and simultaneously generate context-augmented facial expressions. Generations were verified as approximations of mental representations in a study with human participants (GCPR’24).

Research Assistant

05/2019 - 09/2019
Max Planck institute für molecular cell biology, Dresden
  • Master’s thesis: Implemented and compared four reinforcement learning approaches for improving denoising biological microscopy images of cells
  • Capstone project: Developed a 6D pose annota6on tool and CNNs for surgical instrument localizaGon in laparoscopic videos

Student Employee

02/2017 - 12/2017
Monkey Works GmbH, Dresden

Development of an application to create mobile machine control apps.

Research Assistant

03/2017 - 09/2017
Computer lab of the faculty of law, Dresden

Assisting in running the computer lab.

Student Employee

09/2015 - 12/2016
exedio GmbH, Dresden

Java-based backend development.

Student Employee

03/2014 - 08/2015
Konsole Labs GmbH, Dresden

Development of iOS radio apps.

Education

PhD in computer Science

2020 - 2025
Technical University of Berlin

Magna cum laude Focus on

  • Facial expression recognition in context
  • Computer vision
  • Interdisciplinary work with psychologists

Erasmus Semester

2013 - 2014
University of Sheffield

Diploma in Computer Science

2012 - 2019
Technical University of Dresden

Grade: very good (1.4) Focus on

  • Machine learning
  • Computer vision
  • Theoretical computer science

BSc in Computer Science

2011 - 2012
Karlsruhe Institute of Technology

Publications

I was first (or shared-first) author of the following publications:

  • How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations
  • Florian Blume, Runfeng Qu, Pia Bideau, Martin Maier, Rasha Abdel Rahman, Olaf Hellwich
    Accepted to GCPR 2024
  • Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition
  • Marah Halawa, Florian Blume, Pia Bideau, Martin Maier, Rasha Abdel Rahman, Olaf Hellwich
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 4604-4614
  • Knowledge-augmented face perception: Prospects for the Bayesian brain-framework to align AI and human vision
  • Martin Maier, Florian Blume, Pia Bideau, Olaf Hellwich, Rasha Abdel Rahman
    Consciousness and Cognition, Volume 101, 2022, ISSN 1053-8100

    Projects

    Some of my open source projects on GitHub:

    6D-PAT - With 6D-PAT you can create 6D annotations on images for 6D pose estimation, i.e. annotate 2D images with the 3D rotation and 3D translation of 3D models.
    ConCluGen - Code for our paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition"
    Qt3D Offscreen Renderer - Implementation of an offscreen renderer for the Qt3D framework.
    Qt3D-Widget - Implementation of a Qt widget that allows to embed Qt3D.
    Qt3D Gizmo - A 3D gizmo implemented for the Qt3D framework.

    Awards

    Deutschlandstipendium

    2015 - 2016

    Receiver of the Deutschlandstipendium scholarship for talented students for one year.

    Skills & Proficiency

    Python

    PyTorch

    PyTorch Lightning

    CometML

    MLFlow

    Research & Analysis

    Project Managmeent

    Scientific Writing

    C++

    Qt

    TensorFlow

    Java

    Illustrator & Inkscape

    HTML5 & CSS