Portrait of Nigo

Machine Learning Researcher

Building brains

Pushing the boundaries of AI at Siemens R&D. Expertise in developing and deploying enterprise-grade AI solutions—from multimodal foundation models and LLM agents to vision architectures—powered by distributed training pipelines and latency-sensitive inference engines.

Academic & Corporate Experience

Technical University of Munich
Deutsche Bank
Siemens
Friedrich-Alexander-Universitu00e4t
Humboldt University of Berlin

Professional Experience

Corporate Experience

Siemens AG

Machine Learning R&D

Siemens AGBerlin

Jul 2023 – present

ML Researcher, AI R&D Division: Conduct research on deep learning architectures for computer vision-based detection and semantic interpretation of electrical schematics and symbols. Lead comparative analysis of state-of-the-art object detection frameworks (YOLO variants, Co-DETR, InternImage-H, Faster R-CNN), segmentation models (SAM 2), vision transformers (Dual Attention ViT, DINOv2), and multimodal architectures (Qwen2.5-VL) powered by distributed training compute clusters on Azure ML Studio, achieving >96% mean average precision across multiple model configurations and test datasets.

Deutsche Bank

Machine Learning - TDI Division

Deutsche BankBerlin

Jan 2023 – Jun 2023

Developed a multi-agent forecasting framework integrating real-time news sentiment with historical market data for holistic market prediction
Experimented with volatility forecasting models, including Hybrid GARCH-CNN-LSTM and CNN-BiLSTM-Attention architectures, to capture complex nonlinear and long-range temporal dependencies
Implemented channel-independent Patch Time Series Transformers to model non-stationary patterns and cross-channel relationships in asset returns
Integrated DLinear models for efficient, low-latency inference in high-frequency trading environments

Featured Work

Explore my latest work

A collection of my projects, research papers, and machine learning expertise.

Visualization for BMW Agents - A Framework For Task Automation Through Multi-Agent Collaboration

BMW Agents - A Framework For Task Automation Through Multi-Agent Collaboration

Implementing hierarchical task decomposition with deterministic DAG-based execution. Features bi-directional agent communication, vector-embedded episodic memory, semantic toolbox refinement, and configurable prompt strategies.

View Project
Visualization for Memory-augmented Agentic Information Retrieval

Memory-augmented Agentic Information Retrieval

Implementation of Zhang et al. Agentic IR paradigm with a memory-augmented agent architecture. Features stateful information transitions, thought generation, policy learning, and tool integration powered by local LLMs.

View Project
Visualization for Vertical Agents Implementation

Vertical Agents Implementation

Agentic system with BaseMemory, ShortTermMemory, LongTermMemory, VectorMemory. Human-Augmented Agents and RAG Router for knowledge management. Vector Embeddings for semantic search and In-Memory Vector Database.

View Project
Visualization for Transformer-based News Summarization

Transformer-based News Summarization

Advanced NLP project using BART transformer for news summarization. Achieved loss reduction from 1.5276 to 0.1102, with high ROUGE scores (rouge1: 0.7753, rouge2: 0.6970). Integrated with Weights & Biases and Hugging Face Hub.

View Project
Visualization for Graph Neural Networks Classification

Graph Neural Networks Classification

GCN implementation for link prediction on Cora dataset, achieving 87.89% test accuracy. Optimized train-validation-test splits and implemented early stopping with cross-entropy loss evaluation.

View Project
Visualization for Physics Informed Neural Networks

Physics Informed Neural Networks

PINN implementation in PyTorch for 1D harmonic oscillators, combining data fidelity and physical law compliance in the loss function. Includes analytical solution integration and training visualizations.

View Project

Research Papers

Latest Paper Explanations

Deep dives into the most influential research papers in LLMs, computer vision and agentic models.

2024DeepSeek AI Research

DeepSeek-R1: A Robust and Responsible Language Model

Various Authors

Visualization for DeepSeek-R1: A Robust and Responsible Language Model
DeepSeek AI Research

DeepSeek-R1 introduces novel training techniques and architectural improvements to create a more reliable and controllable language model...

2014Facebook AI Research • Tel Aviv University

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Taigman, Yang, Ranzato, Wolf

Visualization for DeepFace: Closing the Gap to Human-Level Performance in Face Verification
Facebook AI Research

DeepFace introduces a nine-layer deep neural network architecture that achieves an accuracy of 97.35% on the Labeled Faces in the Wild (LFW) dataset...

2024Princeton University

Cognitive Architectures for Language Agents

Sumers, Yao, Narasimhan, Griffiths

Visualization for Cognitive Architectures for Language Agents
Princeton University

This paper presents a systematic approach to building language agents with cognitive architectures, exploring the intersection of language models and decision-making systems...

2024Apple • University of Oxford

Distillation Scaling Laws

Busbridge, Shidani, Webb, Littwin

Visualization for Distillation Scaling Laws
Apple

This study reveals fundamental patterns in how knowledge distillation effectiveness scales with model size, data quantity, and architectural choices...

2024BMW Research

BMW Agents: A Framework For Task Automation Through Multi-Agent Collaboration

Crawford, duffy, Evazzade, Foehr

Visualization for BMW Agents: A Framework For Task Automation Through Multi-Agent Collaboration
BMW Research

This framework introduces innovative approaches to multi-agent collaboration, enabling complex task automation through distributed intelligence and coordinated decision-making...

2024Apple Research

Apple Intelligence Foundation Language Models

Various Authors

Visualization for Apple Intelligence Foundation Language Models
Apple Research

Apple's approach to developing and deploying foundation models focuses on privacy-preserving techniques and efficient on-device inference...

2024Shanghai AI Lab • PJLAB

YOLOv9: Learning Using Programmable Gradient Information

Wang, C., Lyu, S., Zhou, X., et al.

Visualization for YOLOv9: Learning Using Programmable Gradient Information
Shanghai AI Lab

YOLOv9 introduces revolutionary techniques for gradient manipulation during training, enabling more effective feature learning and state-of-the-art detection accuracy...

2024Shanghai AI Lab • CUHK

YOLOv12: Attention-Centric Real-Time Object Detection

Wang, C., Ren, Y., Lyu, S., et al.

Visualization for YOLOv12: Attention-Centric Real-Time Object Detection
Shanghai AI Lab

YOLOv12 incorporates novel attention mechanisms to enhance feature representation while maintaining the speed advantage of the YOLO architecture...

2024Meta AI Research • FAIR

SAM 2: Segment Anything in Images and Videos

Kirillov, A., Mintun, E., Ravi, N., et al.

Visualization for SAM 2: Segment Anything in Images and Videos
Meta AI Research

SAM 2 introduces significant improvements over its predecessor, including better boundary precision, enhanced zero-shot capabilities, and extended functionality for video segmentation...

2024Meta AI Research • FAIR

Transformer-Squared: Self-Adaptive LLMs

Ranzato, M., Touvron, H., Grave, E., et al.

Visualization for Transformer-Squared: Self-Adaptive LLMs
Meta AI Research

Transformer-Squared introduces a meta-learning approach for language models, allowing them to reconfigure their parameters on-the-fly for specific tasks without explicit fine-tuning...