Lead Backend Engineer (AI Integration & Real-Time Systems)

Closed

Backend Backend FastAPI Python

Icon Location Location
Ho Chi Minh

Benefits

Laptop/desktop for works Laptop/desktop for works
Other benefits Other benefits
Flexible working time Flexible working time
Work-from-home Work-from-home

Job Overview And Responsibility

Role Overview We are seeking a highly experienced Lead Backend Engineer to architect and implement the core backend systems for Felix AI – an innovative EdTech platform leveraging AI to deliver real-time, personalized tutoring. This role is central to building scalable, secure, low-latency systems on Google Cloud Platform, integrating advanced AI services (Gemini, Vertex AI, ADK), and supporting real-time interactions via WebSockets and audio streaming. Key Responsibilities ●​ Design, implement, and scale backend services on GCP with a focus on performance, resilience, observability, and low-latency for real-time communication. ●​ Develop RESTful APIs using FastAPI (OpenAPI 3.0 compliant) and efficient Firestore data models. ●​ Integrate and orchestrate ADK-based pedagogical agents on Vertex AI, define toolkits (FunctionTools, BaseTools), and manage agent context/state. ●​ Implement Gemini Pro/Flash/Live APIs, including streaming logic, configuration handling, and prompt engineering for optimal AI performance. ●​ Architect and operate a real-time communication layer over WebSockets (Python) with secure, performant audio (PCM) streaming. ●​ Manage distributed session state using Redis (Memorystore). ●​ Collaborate with iOS engineer for API/WebSocket contracts and integration. ●​ Work with the Full-Stack/DevOps engineer on infrastructure, CI/CD, monitoring, and backend optimization. ●​ Contribute to mentoring, code reviews, documentation, and architectural decisions.​ Key Performance Indicators (KPIs) ●​ API p95 response time < 500ms (excluding network latency) ●​ 99.9% backend uptime ●​ Complete and accurate OpenAPI documentation (>95%) ●​ Successful agent integration and delivery aligned with roadmap ●​ Load-tested scalability to target concurrency thresholds

Required Skills and Experience

●​ 7–10+ years in backend engineering, with 5+ years building scalable, distributed, real-time systems. ●​ Expert in Python, FastAPI, WebSocket development (including TLS/WSS). ●​ Strong experience with GCP (Cloud Run, Firestore, Memorystore, Pub/Sub, Vertex AI). ●​ Proficient in integrating LLM APIs (Gemini preferred); experience with ADK or equivalent frameworks (LangChain, LangGraph). ●​ Hands-on experience building RAG systems (ideal in educational context). ●​ Familiarity with prompt engineering, distributed systems, performance tuning.​

Why Candidate should apply this position

- Work arrangement: Hybrid (office 1-2 days/week) - Full-time labour contract

Preferred skills and experiences

●​ Technical leadership experience in AI or EdTech projects. ●​ GCP certifications (Cloud Developer or Cloud Architect).​

Report to

Client

Interview process

Technical interview and English check -> Client review, assignment (3hrs) -> Interview with client side

Vũ Tú

Headhunter | Recruiter
employee 84 candidates
cup 1 interviews
health 0 offers

Vũ Tú

Headhunter | Recruiter
Icon employee 84 candidates
Icon cup 1 interviews
Icon health 0 offers

Completed jobs (0)
Sign In

I don't have an account