AI Engineer - Video Analysis Core

AI/Artificial Intelligence AI/ML Computer Vision

Icon salary 年収
$1,900 まで
Icon Location Location
Ho Chi Minh

Benefits

フル社会保険 フル社会保険
年間給与の見直し 年間給与の見直し
旅行/会社の旅行 旅行/会社の旅行
仕事用のノートパソコン/デスクトップ 仕事用のノートパソコン/デスクトップ
業績ボーナス 業績ボーナス
追加の健康保険 追加の健康保険
13ヶ月目の給与 13ヶ月目の給与
その他の福利厚生 その他の福利厚生

Job Overview And Responsibility

Established for over 20 years in Vietnam with more than 350 members, GIANTY is proud of providing numerous successful services to millions of users worldwide OUR CORE SERVICES: Development Services (Outsource & Offshore), In-House Development (Apps, Games, Anime, Arts, Design, Big Data/Analytics, IoT, VR, AI) Your Mission You will be the founding AI engineer to architect and build the core video analysis engine. The system will combine multi-modal perception (video, audio, text), LLMs, and multi-agent reasoning to understand human behavior, actions, and outcomes from raw video streams. You will work closely with product leaders and domain experts to turn ideas into working features, ensuring our platform adapts across domains while staying fast, accurate, and scalable. - Design and implement the AI pipeline for video understanding, including: - Frame extraction; object and behavior detection - Temporal event segmentation and context tagging - Multimodal fusion (vision, audio, transcripts, sensor data) - Integrate LLMs, RAG, and multi-agent orchestration to interpret, summarize, and reason over video events. - Build real-time or near-real-time inference workflows with a focus on performance and reliability. - Own end-to-end delivery from research prototypes to production-ready APIs and services. - Collaborate with frontend engineers and PMs to define features and deliver “show, not tell” demos weekly. - Optimize models for both edge devices and cloud deployment. - Keep the system domain-agnostic, ensuring adaptability across Education, Retail, Operations, Security, and other verticals.

Required Skills and Experience

- Builder mindset: You turn abstract ideas into fast, tangible results. - Hands-on problem solver: You don’t wait for perfect specs—you ship MVPs, iterate quickly, and improve continuously. - Strong AI/ML foundations with at least 3 years of proven experience in: - Video analytics & computer vision (YOLO, OpenCV, Mediapipe, etc.) - Multimodal models & LLM integration (OpenAI, Gemini, or similar) - Retrieval-Augmented Generation (RAG) pipelines - Multi-agent frameworks (LangChain, CrewAI, AutoGen, etc.) - Technical skills: Python, PyTorch/TensorFlow, API development, and cloud/edge deployment (AWS, GCP, Jetson, etc.). - Data technologies: Vector databases, embeddings, and scalable data pipelines.

Why Candidate should apply this position

- A core role shaping a multi-domain AI product platform from zero to scale. - Opportunity to own the architecture and roadmap of a critical system. - A builder culture: fast iterations, weekly user demos, “done > perfect”. - Exposure to multi-domain problems (Education, Retail, Industrial). - Competitive salary + stock options for early team members. - Performance review (end of year) and salary review (in June) every year. - SHUI and Health Insurance - Working hour: 8:00-17:00, Mon to Fri

Preferred skills and experiences

- Startup mindset: Comfortable working in a fast-paced, ambiguous environment with direct interaction with founders and customers.

Report to

CTO

Interview process

1 vòng trực tiếp tại VP

Peter Lim

Headhunter | Recruiter
Verified
employee 0 件の履歴書
cup 0 件の面接
health 0 件のオファー

Apply for this job

Successfully!

Thank you, you have sent the information successfully.

← View more Peter Lim's jobs
upload Click or drag file to this area to upload PDF only (3MB), You can update only 1 CV

Peter Lim

Headhunter | Recruiter
Verified
Icon employee 0 件の履歴書
Icon cup 0 件の面接
Icon health 0 件のオファー

ご成約済みの案件 (0)