Hi, I'm Mohammad Ayaz Alam

Data Scientist & AI Engineer

Passionate about leveraging data and AI to solve complex problems. My expertise spans machine learning, generative AI, and full-stack data solutions. I thrive at the intersection of technology and research, delivering impactful insights and innovative solutions.

(+49) 017684051947
alam.ayaz47@gmail.com
Nuremberg, Germany

My Skills

I've cultivated a diverse set of technical skills through both academic studies and professional experience

Programming & Data Science

Python Pandas NumPy Matplotlib Seaborn Scikit-learn PyTorch Jupyter R

Generative AI & NLP

LLMs LangChain Fine-Tuning RAG Llamaindex Prompt Engineering Hugging Face

Databases & Big Data

SQL PostgreSQL Database Design Query Optimization MongoDB

Data Visualization & BI

Tableau Power BI Plotly Dash

Web Development & APIs

Django Flask FastAPI RESTful APIs HTML CSS JavaScript Bootstrap TailwindCSS

DevOps & Tools

Docker Git/GitHub CI/CD Pipelines Linux/Unix

Work Experience

My professional journey through research institutions and tech companies

Junior Data Scientist (NLP) — Master's Thesis

Siemens Energy

July 2025 - Present

Erlangen, Germany (Hybrid)

  • Integrated agentic AI into TMS for operator support: log traces, reports, fix plans (config diffs, restart/rollback), emails.
  • Implemented MCP server to orchestrate hierarchical multi-agent system (parent/child) with role-specific tools.
  • Built NLP components (retrieval, summarization, entity extraction) over TMS/knowledge bases for tools and copilots.

Data Consultant Intern

BASF

April 2025 - July 2025

Ludwigshafen am Rhein, Germany

  • Supported internal clients across business units by building and integrating Generative AI tools into daily workflows.
  • Developed a Data Validation AI Agent to analyze structured datasets, generate validation reports, compute quality scores, and run custom validation tests.
  • Led the end-to-end event app for Inhouse Consulting (submissions, voting, UI, analytics, and access controls).
  • Contributed to internal project management tools to streamline team operations and task tracking.

Research Assistant

Institute for Employment Research

Mar 2025 - Present

Nuremberg, Germany

  • Applied ML techniques (PU learning: Elkan & Noto, Two-Step Strategy, Bagging) for job market analysis.
  • Designed and optimized data pipelines for efficient processing, management, and feature engineering.
  • Conducted data analysis, visualization, and exploratory data analysis (EDA) to extract key labor market insights.
  • Engaged in research paper writing, proofreading, and technical documentation.

Machine Learning Researcher

Institute for Employment Research

Nov 2024 - Feb 2025

Nuremberg, Germany

  • Applied ML techniques (XGBoost, Random Forest, SVM, PU learning) to forecast labor market trends.
  • Built scalable data pipelines for preprocessing and feature engineering on 90M+ records.
  • Addressed class imbalances through weight estimation, improving model robustness and accuracy.
  • Developed predictive models to rank job offers, aiding caseworkers in pre-selecting opportunities.
  • Delivered insights on labor market dynamics through reports and stakeholder presentations.

Business Development Intern

Shape AI

Mar 2021 - Oct 2021

Delhi, India

  • Performed data analysis using Power BI, extracting actionable insights from sales data.
  • Developed dashboards and visualizations to clearly communicate key metrics.
  • Designed a data-backed course structure for the winter training program.
  • Contributed to marketing by analyzing target audience behavior.

Web Developer Intern

Hoda Engineering Works

Sept 2020 - Jan 2021

Delhi, India

  • Built responsive websites using HTML, CSS, Bootstrap, and JavaScript.
  • Performed testing and debugging on various devices to improve UX.
  • Optimized JavaScript and CSS for faster load times.
  • Adopted modern web frameworks and best practices.

Featured Projects

My hands-on projects that demonstrate practical applications of my skills

Fine-Tuned GPT-2 for Poetry Generation with RL

Poetry Generation with RL

AI/ML

Fine-tuned GPT-2 (124M) on Gutenberg poetry dataset using Reinforcement Learning to enhance stylistic quality.

GPT-2 RL Transformers

Key Features

  • Custom reward functions for rhyme, coherence, and creativity
  • Adaptive learning rates and policy gradients
  • Publicly available on HuggingFace
View Project
RAG Chatbot for Emulating Chat Tone

Persona Chatbot

NLP

RAG chatbot with Gemini Pro API for personalized conversation emulation from WhatsApp history.

RAG LangChain FAISS Streamlit

Key Features

  • Persona-based response system with consistent tone
  • Efficient similarity search with FAISS
  • Intuitive UI with real-time chat
View Project
Biller - Your Friendly Bill Management System

Biller

Full-Stack

Full-stack expense management system with Gemini AI for OCR-based receipt processing.

Streamlit Supabase PostgreSQL Plotly

Key Features

  • OCR-based receipt processing
  • Secure authentication with Supabase
  • Interactive analytics dashboard
View Project
Audio to Text Transcription App

Audio to Text Transcription App

AI/Speech

Streamlit-based application to transcribe audio files into text, supporting multiple languages.

Streamlit Transcription

Key Features

  • Multi-language support
  • Accurate transcription
  • User-friendly interface
View Project
Be-Healthier

Be-Healthier

Web App

A web-based health and fitness tracker built with Streamlit.

Health Fitness

Key Features

  • Health monitoring
  • Fitness tracking
  • User dashboard
View Project
Advanced WhatsApp Chat Analytics Platform

Advanced WhatsApp Chat Analytics Platform

Analytics

Analyzes WhatsApp chat data with interactive visualizations.

Visualization Data

Key Features

  • Interactive charts
  • Chat data analysis
  • Data export
View Project
Food Locha

Food Locha

AI

An AI-powered app that analyzes meal images, calculates calorie details, and provides nutritional insights.

Nutrition Image Analysis

Key Features

  • Meal image analysis
  • Calorie calculation
  • Nutritional insights
View Project
AI-Powered Image-to-Story Generator

AI-Powered Image-to-Story Generator

AI/Creative

Converts images into short stories using Gemini AI and gTTS.

Image Story

Key Features

  • Image-to-text conversion
  • Story generation
  • Gemini AI integration
View Project
Gemini ATS Resume Improvement Tool

Gemini ATS Resume Improvement Tool

AI/ATS

AI-powered app to evaluate resumes and generate cover letters based on job descriptions.

Resume ATS

Key Features

  • Resume evaluation
  • Cover letter generation
  • Job description matching
View Project
Interactive RAG-Based QA System

Interactive RAG-Based QA System

RAG/QA

A RAG system to analyze lecture slides with LlamaIndex and Llama-2-7b-chat-hf.

Q&A LlamaIndex

Key Features

  • Slide analysis
  • Interactive Q&A
  • LlamaIndex integration
View Project
Email/SMS Spam Classifier

Email/SMS Spam Classifier

NLP

ML model for classifying messages as spam or ham using NLP techniques.

NLP Classifier

Key Features

  • Spam detection
  • NLP-based classification
  • High accuracy
View Project
Emotion Detection System

Emotion Detection System

CNN/Deep Learning

CNN-based system to classify facial expressions into seven emotions with TensorFlow and Keras.

TensorFlow Keras

Key Features

  • Facial expression classification
  • Real-time detection
  • Seven emotion categories
View Project

My Education

MSc. Data Science

Friedrich-Alexander-Universität Erlangen-Nürnberg

Oct 2022 - Present

Erlangen, Germany

Courses: Business Analytics, ML Time Series, ML in Finance, Mathematics of Learning, Linked Data

Bachelors of Computer Applications

Guru Gobind Singh Indraprastha University

Aug 2018 - Sept 2021

Delhi, India

Courses: OOPs, Network Security, DBMS, Web Technologies, Data Warehousing and Data Mining

My Certificates

AI Agents Fundamentals

Huggingface - Agents fundamentals, Tools and Action, Agents Workflow

View Certification

Full Stack Data Analytics

iNeuron.ai - Completed August 2024

View Certification

Deep Learning Foundation

iNeuron - Neural Network, Backpropagation, CNN

View Certification

Intro to Machine Learning

Kaggle - Model Validation, Random Forest

View Certification

NLP Foundation

iNeuron - Completed February 2023

View Certification

ChatGPT Prompt Engineering

DeepLearning.ai - Developer-focused strategies

View Certification

Responsive Web Design

freeCodeCamp - Web development fundamentals

View Certification

Python Basic

Hackerrank - Core Python concepts

View Certification

Data Analysis with Python

freeCodeCamp - Numpy, Pandas, Data Cleaning

View Certification

Automate Boring Stuff with Python

Udemy - Practical Python automation

View Certification

Get In Touch

Feel free to reach out for collaborations, job opportunities, or just to say hi!

Contact Information

Phone

(+49) 017684051947

Email

alam.ayaz47@gmail.com

Location

Nuremberg, Germany

Connect

Send Me a Message