Python Backend & AI Systems Engineer

LLM Integration · RAG Architectures · Vector Search · High-Performance Backend Systems

I architect and build production-grade AI systems powered by LLMs, vector search, and high-throughput async pipelines. My focus is scalable backend infrastructure, reliability, and measurable performance.

Remote-first · Open to EU & US opportunities

Engineering Profile

AI-focused backend engineer building scalable LLM-powered systems, vector retrieval pipelines, and high-performance data platforms.

I treat systems as long-term infrastructure assets - prioritizing reliability, observability, and performance. My work spans architecture design, RAG pipeline implementation, async processing optimization, API-first development, and containerized deployment.

Strong focus on performance-aware backend design and failure-resilient systems.

What I Deliver

Production-ready LLM integration into backend systems

Scalable data & RAG pipelines powered by vector search

Async & concurrent architectures with measurable performance gains

Fault-tolerant web data acquisition systems (<1% failure rate)

API-first backend services ready for product integration

From architecture to deployment - full lifecycle ownership

Tech Stack

Backend & Core

Python · FastAPI · PostgreSQL · MongoDB · REST APIs

AI & Data

LLM Integration · RAG Pipelines · pgvector · Embeddings · Prompt Engineering

Performance & Scalability

Asyncio · Multiprocessing · Concurrency Patterns · High-Load Backend Design

Infrastructure

Docker · Linux · Git · Containerized Deployment

Web Data Acquisition

Playwright · Selenium · Proxy Rotation · Anti-bot Strategies

AI System Architecture

Typical production-grade AI backend architecture combining async ingestion, vector search, and LLM-powered retrieval.

1. Data Ingestion

Async pipelines · API ingestion · Structured data validation

2. Embedding & Indexing

OpenAI embeddings · pgvector · Indexed similarity search

3. Retrieval Layer

Semantic search · Context assembly · Scored ranking

4. LLM Processing

RAG pipeline · Prompt engineering · Controlled generation

5. API Delivery

FastAPI backend · JSON response · Containerized deployment

Selected Projects

Retrieval-Augmented AI System (RAG + Vector Search)

Designed and implemented a production-ready AI-driven retrieval system combining structured data ingestion, vector embeddings, and semantic search. The solution enables efficient similarity-based retrieval and serves as a foundation for retrieval-augmented generation (RAG) workflows.

View Details

High-Load Protected Data Acquisition & API Delivery System

Designed and implemented a resilient high-volume data acquisition system with a FastAPI-based backend layer for structured data delivery. The system operated under advanced anti-bot constraints while exposing clean, programmatic access to normalized datasets.

View Details

Scalable Data Aggregation & Normalization Engine

Architected a high-throughput data ingestion system aggregating structured records from multiple dynamic platforms. The system ensures consistent normalization, incremental updates, and reliable downstream delivery.

View Details

Engineering Impact

Built multiple production-grade AI & backend systems

Designed AI-ready data architectures supporting semantic search and intelligent retrieval

Optimized processing throughput by up to 8×

Maintained <1% failure rate in large-scale extraction pipelines

Delivered full-cycle systems from architecture to deployment

Designed scalable data systems processing tens of thousands of records

Professional Certificates

Let's Build Scalable AI Systems

Open to remote roles across EU & US markets.

AI Backend Engineer · LLM Systems Engineer · RAG & Vector Search Architect