Malek Senoussi — ML Engineer & PhD

The thread

Three years teaching models to classify cells they'd never seen. The same muscle — rigorous evaluation, principled uncertainty, systems that survive messy real data — is what makes LLM agents useful in production.

01Core expertise

Classification at scale

High-dim data (20K+ features), weakly-supervised and hierarchical learning. Partial labels, novel-class discovery, single-cell RNA-seq.

LLM systems & agents

RAG, prompt engineering, RL loops for agent training. Instrumented evaluation, reward-hacking prevention, reliability-focused design.

Production & MLOps

Docker, CI/CD, experiment tracking, Streamlit dashboards, model monitoring. Research systems built to be reproducible and deployed — internal tools serving researcher workflows.

Core ML

Python PyTorch Scikit-learn Docker SQL Git Bash

LLM, evaluation & infra

LangChain LangGraph Claude Sonnet MLflow Streamlit AWS SLURM

02Selected work

Featured · latest

KAN RL Environment — LLM agent for scientific law discovery

An LLM agent trained via a reinforcement learning loop to autonomously fit Kolmogorov-Arnold Networks for symbolic regression. Structured feedback on generalization, OOD extrapolation, and expression parsimony — with no gradient flowing through the LLM.

Best score (GPT-OSS 120B)

0.694

Threshold 0.65 reached at

round 5 / 20

→ Evaluation loop architecture for autonomous agent training. A building block for research systems where LLMs train specialized models without human supervision.

Python RL KAN LLM agents Symbolic regression Docker

View on GitHub →

Featured

Board Game Generator — Multi-agent pipeline

A LangGraph pipeline orchestrating 5 specialized Claude Sonnet agents that generate fully playable browser games from a single text prompt. Each agent handles a distinct stage — game design, code generation, SVG assets, QA validation, documentation — with an automatic retry loop when the Tester detects blocking bugs.

→ 3 games generated end-to-end. €10 in API costs. Zero manual fixes on pipeline v2 vs 5 on v1 — a measurable robustness improvement across iterations.

Games generated

3

Total API cost

€10

→ Multi-agent orchestration with a stateful LangGraph graph, structured output, retry loops, JS syntax validation, and full CI/CD. Games are live and playable.

Python LangGraph Claude Sonnet Multi-agent Prompt engineering GitHub Actions

View on GitHub → Live demo →

Featured

LLM Diagnostic Framework

Systematic framework for diagnosing LLM failures and testing optimization strategies in production. Case study on medical entity extraction; modular and extensible to other use cases.

→ Reduced inference cost by 10× while maintaining accuracy (91% vs 92%) on medical entity extraction.

Pick the right model for the right complexity and cost.

Python LLMs Prompt engineering Medical NLP Docker

View on GitHub →

Featured

High-dimensional classification pipeline

Deep learning models for biological data with 20K+ features. Custom PyTorch architectures with automated feature selection and dimensionality reduction, achieving state-of-the-art performance on complex scRNA-seq classification.

→ State-of-the-art performance on real scRNA-seq datasets. Benchmarked against existing methods in peer-reviewed work (see Publications).

Python PyTorch Deep learning Bioinformatics scRNA-seq

Associated publications →

Energy forecasting system

End-to-end ML solution for energy demand forecasting with a Streamlit dashboard for real-time monitoring and decision-making.

→ Full pipeline from data ingestion to deployed dashboard.

Scikit-learn Streamlit Time series

View project →

03Publications

Partial label learning for automated classification of single-cell transcriptomic profiles
PLOS Computational Biology · 2024

Machine learning models for classifying single-cell RNA sequencing data under partial label learning. New methods benchmarked against adapted existing approaches on real and synthetic datasets.
Hierarchical novel class discovery for single-cell transcriptomic profiles
Preprint arXiv · 2024

Extreme learning scenario where models must classify without label knowledge. A hierarchical hypothesis on labels completes the learning schema for predictions on unlabelled data.
Random walk informed heterogeneity detection reveals how the lymph node conduit network influences T cells collective exploration behavior
PLOS Computational Biology · 2023

Contributed to an interactive Plotly/Dash visualization tool for random-walk simulations on large-scale networks.
Classification hiérarchique pour des données transcriptomiques faiblement supervisées
Conférence sur l'Apprentissage automatique · 2022

Extension of hierarchical classification to the weakly-supervised problem. Three algorithms benchmarked on C. elegans transcriptomic profiles.

04Education

PhD in Mathematics and Computer Science

Aix-Marseille University · 2024

Classification of single-cell RNA sequencing
Master 2 Mathematics and Applications (CEPS)

Aix-Marseille University · 2020

05What I'm looking for

ML engineering roles bridging research and production

Focus areas: LLM evaluation, autonomous agents, applied biological ML

Research-focused startups or deeptech scale-ups

Europe-based (Switzerland, France, Netherlands, UK) or remote