Machine Learning Engineer / AI Research Engineer / Ph.D. Researcher

Lukáš Jochymek

I build transformer-based systems, LLM applications, and data-heavy ML platforms, then spend a healthy amount of time making them faster, cheaper, and less dramatic in production.

Most of my work lives somewhere between applied research, backend engineering, cloud infrastructure, and asking PyTorch to please behave on multi-GPU workloads. I am currently focused on production ML at Rankacy AI while continuing Ph.D. research in bioinformatics and computational biology.

  • AI/ML & Cloud Engineer at Rankacy AI since 2024
  • Former Machine Learning Engineer at NXP Semiconductors
  • Ph.D. in Bioinformatics & Computational Biology at VSB-TUO
  • Lecturer for AI/ML and Python courses

What I do

Research-minded engineering with a strong shipping instinct.

I like work that survives both curiosity and production traffic.

Production ML systems

Transformer models, LLM assistants, fine-tuned workflows, backend services, and the unglamorous optimization work that makes models usable outside demos.

PyTorch Transformers LLMs Quantization

Data and platform

TB-scale pipelines with Python, Spark, Delta Lake, RabbitMQ, and cloud-native deployment on AWS and Kubernetes with proper observability and CI/CD.

Spark Delta Lake AWS Kubernetes

Research that stays practical

Behavioral modeling, explainability, hyperbolic ML, biomedical AI, and model inspection tooling for answering why the model did something suspicious.

Explainability Hyperbolic ML Vision-Language Models Bioinformatics

Selected Work

The recent chapter.

Text-first on purpose. The projects are newer than the old screenshots anyway.

Applied LLM work Rankacy AI

LLM commentary and assistants

Prototyped automated CS2 commentary with gameplay events, LLMs, and TTS, while also building task-specific assistants and internal tools around model usage.

  • Evaluated multiple LLM architectures for latency, quality, and infrastructure cost.
  • Built internal visualization tools for model inspection, debugging, and explainability-oriented analysis.
Cloud and data Rankacy AI

TB-scale pipelines and infrastructure

Architected large-scale data and platform systems with Python, RabbitMQ, Spark, Delta Lake, AWS Glue, Terraform, ArgoCD, GitHub Actions, Karpenter, and KEDA.

  • Optimized infrastructure cost with spot instances and event-driven autoscaling.
  • Handled messy real-world issues, including Delta Lake implementation bugs and hardware instability.
NXP Semiconductors 2021 - 2024

Embedded AI on constrained hardware

Built embedded AI workflows for NXP S32K microcontrollers, from dataset preparation and augmentation to quantization, benchmarking, and C/C++ deployment.

  • Worked with TensorFlow, PyTorch, ONNX, model compression, and low-level debugging constraints.
  • Contributed to automotive touch-screen R&D projects under NDA.

Foundations

ML is the main plot, but not the whole backstory.

The older projects still matter because they taught me how to build, debug, and actually finish things.

Apps and algorithms

Desktop and systems work in C/C++, C#, and Python, including SDL experiments, a space shooter, maze generation and solving, and the usual university algorithm detours.

Low-level and data tooling

Comfortable with SQL, PL/SQL, T-SQL, CUDA, assembly, Git, Linux, and the kind of debugging that politely reminds you abstractions are optional.

Experience

The short version, split into industry and academia.

Professional

Apr 2024 - Present

AI/ML & Cloud Engineer · Rankacy AI

Own work across transformer modeling, LLM features, cloud infrastructure, data pipelines, backend refactors, and production ML systems.

Dec 2022 - Apr 2024

Machine Learning Engineer · NXP Semiconductors

Developed embedded AI systems for constrained hardware, including model benchmarking, optimization, and deployment in C/C++ environments.

Nov 2021 - Dec 2022

Machine Learning Intern · NXP Semiconductors

Worked on embedded computer vision tasks, ONNX/TensorFlow workflows, and low-level debugging around storage and deployment constraints.

Academic

2024 - Present

Ph.D. in Bioinformatics & Computational Biology · VSB-TUO

Researching hyperbolic machine learning and biomedical applications, while teaching AI/ML and Python courses and supervising bachelor theses.

2022 - 2024

MSc. in Computer Science (Data Analysis)

Dean's Award thesis: X-Ray Image Analysis and Processing.

Research

Questions I keep coming back to.

Human behavior, explainability, representation learning, and practical systems for running those ideas in the real world.

Current research interests

  • Behavioral modeling from gameplay data and inference of human decision-making patterns
  • Explainability and interpretability for deep learning models
  • Human-like AI agents and realistic aiming behavior in game environments
  • Hyperbolic machine learning for biological and biomedical data
  • Vision-language models in medicine and neuroscience-inspired questions around cognition and behavior

Publications and teaching

  • CLaRA: Cost-Effective LLM Function Calling Based on a Vector Database, MENDEL Journal, 2024
  • Additional accepted publication work, including research focused on hyperbolic machine learning
  • Lecturer for AI/ML and Python courses, plus bachelor thesis supervision and ML consulting

Core stack

PyTorch TensorFlow Transformers LangChain Python SQL Polars Pandas Spark Delta Lake RabbitMQ Redis AWS Docker Kubernetes Terraform ArgoCD GitHub Actions

About

Still curious about the same things, just with more GPUs around.

Outside work

I still enjoy algorithms, longboarding, black holes, psychology, neuroscience, and music with probably more emotional commitment than is strictly necessary.

How I like to work

Research depth when needed, production ownership when it counts, and enough range to move between model code, infrastructure, data work, and debugging without getting precious about it.

Contact

If you want to talk about applied ML, research, weird product ideas, or something interesting you are building, the links below are the fastest way to find me.

Languages: Czech (native), English (C1), German (A2), Polish (A2)