Machine Learning Researcher

Building brains

Pushing the boundaries of AI at Siemens R&D. Expertise in developing and deploying enterprise-grade AI solutions—from multimodal foundation models and LLM agents to vision architectures—powered by distributed training pipelines and latency-sensitive inference engines.

View Projects

Schedule a Meeting

Academic & Corporate Experience

Professional Experience

Corporate Experience

Machine Learning R&D

Siemens AG • Berlin

Jul 2023 – present

ML Researcher, AI R&D Division: Conduct research on deep learning architectures for computer vision-based detection and semantic interpretation of electrical schematics and symbols. Lead comparative analysis of state-of-the-art object detection frameworks (YOLO variants, Co-DETR, InternImage-H, Faster R-CNN), segmentation models (SAM 2), vision transformers (Dual Attention ViT, DINOv2), and multimodal architectures (Qwen2.5-VL) powered by distributed training compute clusters on Azure ML Studio, achieving >96% mean average precision across multiple model configurations and test datasets.

Machine Learning - TDI Division

Deutsche Bank • Berlin

Jan 2023 – Jun 2023

Developed a multi-agent forecasting framework integrating real-time news sentiment with historical market data for holistic market prediction

Experimented with volatility forecasting models, including Hybrid GARCH-CNN-LSTM and CNN-BiLSTM-Attention architectures, to capture complex nonlinear and long-range temporal dependencies

Implemented channel-independent Patch Time Series Transformers to model non-stationary patterns and cross-channel relationships in asset returns

Integrated DLinear models for efficient, low-latency inference in high-frequency trading environments

Featured Work

Explore my latest work

A collection of my projects, research papers, and machine learning expertise.

BMW Agents - A Framework For Task Automation Through Multi-Agent Collaboration

Implementing hierarchical task decomposition with deterministic DAG-based execution. Features bi-directional agent communication, vector-embedded episodic memory, semantic toolbox refinement, and configurable prompt strategies.

View Project

Memory-augmented Agentic Information Retrieval

Implementation of Zhang et al. Agentic IR paradigm with a memory-augmented agent architecture. Features stateful information transitions, thought generation, policy learning, and tool integration powered by local LLMs.

View Project

Vertical Agents Implementation

Agentic system with BaseMemory, ShortTermMemory, LongTermMemory, VectorMemory. Human-Augmented Agents and RAG Router for knowledge management. Vector Embeddings for semantic search and In-Memory Vector Database.

View Project

Transformer-based News Summarization

Advanced NLP project using BART transformer for news summarization. Achieved loss reduction from 1.5276 to 0.1102, with high ROUGE scores (rouge1: 0.7753, rouge2: 0.6970). Integrated with Weights & Biases and Hugging Face Hub.

View Project

Graph Neural Networks Classification

GCN implementation for link prediction on Cora dataset, achieving 87.89% test accuracy. Optimized train-validation-test splits and implemented early stopping with cross-entropy loss evaluation.

View Project

Physics Informed Neural Networks

PINN implementation in PyTorch for 1D harmonic oscillators, combining data fidelity and physical law compliance in the loss function. Includes analytical solution integration and training visualizations.

View Project

Research Papers

DeepSeek-R1: A Robust and Responsible Language Model

Various Authors

DeepSeek AI Research

DeepSeek-R1 introduces novel training techniques and architectural improvements to create a more reliable and controllable language model...

2014•Facebook AI Research • Tel Aviv University

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Taigman, Yang, Ranzato, Wolf

Facebook AI Research

DeepFace introduces a nine-layer deep neural network architecture that achieves an accuracy of 97.35% on the Labeled Faces in the Wild (LFW) dataset...

2024•Princeton University

Cognitive Architectures for Language Agents

Sumers, Yao, Narasimhan, Griffiths

Princeton University

This paper presents a systematic approach to building language agents with cognitive architectures, exploring the intersection of language models and decision-making systems...

2024•Apple • University of Oxford

Distillation Scaling Laws

Busbridge, Shidani, Webb, Littwin

Apple

This study reveals fundamental patterns in how knowledge distillation effectiveness scales with model size, data quantity, and architectural choices...

2024•BMW Research

BMW Agents: A Framework For Task Automation Through Multi-Agent Collaboration

Crawford, duffy, Evazzade, Foehr

BMW Research

This framework introduces innovative approaches to multi-agent collaboration, enabling complex task automation through distributed intelligence and coordinated decision-making...

2024•Apple Research

Apple Intelligence Foundation Language Models

Various Authors

Apple Research

Apple's approach to developing and deploying foundation models focuses on privacy-preserving techniques and efficient on-device inference...

2024•Shanghai AI Lab • PJLAB

YOLOv9: Learning Using Programmable Gradient Information

Wang, C., Lyu, S., Zhou, X., et al.

Shanghai AI Lab

YOLOv9 introduces revolutionary techniques for gradient manipulation during training, enabling more effective feature learning and state-of-the-art detection accuracy...

2024•Shanghai AI Lab • CUHK

YOLOv12: Attention-Centric Real-Time Object Detection

Wang, C., Ren, Y., Lyu, S., et al.

Shanghai AI Lab

YOLOv12 incorporates novel attention mechanisms to enhance feature representation while maintaining the speed advantage of the YOLO architecture...

2024•Meta AI Research • FAIR

SAM 2: Segment Anything in Images and Videos

Kirillov, A., Mintun, E., Ravi, N., et al.

Meta AI Research

SAM 2 introduces significant improvements over its predecessor, including better boundary precision, enhanced zero-shot capabilities, and extended functionality for video segmentation...

2024•Meta AI Research • FAIR

Transformer-Squared: Self-Adaptive LLMs

Ranzato, M., Touvron, H., Grave, E., et al.

Meta AI Research

Transformer-Squared introduces a meta-learning approach for language models, allowing them to reconfigure their parameters on-the-fly for specific tasks without explicit fine-tuning...