talentyGo

SR. CLOUD INFRASTRUCTURE ENGINEER (AI & LLM PLATFORMS)

Q6 Cyber

📍 United States, US0💼 Tempo pieno🕐 13 giorni fa
Candidati ora →

Crea un account gratis in 30 secondi: ottieni anche il match score AI con il tuo CV.

Descrizione

Job Description We are seeking a specialized Infrastructure Engineer to bridge the gap between our large data repositories, Cloud Platform and the rapidly evolving world of Large Language Models (LLMs). You will be responsible for building the "plumbing" that allows our internal teams and external users to leverage AI effectively. This includes deploying Model Context Protocol (MCP) servers, building agentic execution environments, and scaling our internal Retrieval-Augmented Generation (RAG) architecture. roles and responsibilities Key Responsibilities AI Architecture Guidance: Guide the architecture that will allow us to leverage AI tools with our large existing data stores and incoming streams of realtime intelligence. Cross-Team Integration: Work closely with other infrastructure engineers and software development teams to integrate AI tools into existing systems. MCP Ecosystem Management: Design, deploy, and maintain Model Context Protocol (MCP) servers to allow LLMs to securely interact with our internal databases, APIs, and external tooling. Agentic Infrastructure: Build and orchestrate sandboxed, scalable environments (e.g., using Docker or specialized runtimes) where users can safely build and execute AI agents. Internal RAG Platform: Develop and manage the infrastructure for our internal RAG (Retrieval-Augmented Generation) pipeline, including vector database management (e.g., Pinecone, Weaviate, or pgvector) and automated embedding pipelines. Deployment & Scaling: Utilize Kubernetes (K8s) and Infrastructure as Code (Terraform/Pulumi) to deploy LLM-related tools, ensuring high availability and low latency for model inference and data retrieval. Security & Governance: Implement strict guardrails for data privacy within LLM workflows, ensuring internal datasets remain secure while being accessible to authorized AI tools. Required Qualifications Required Qualifications: 5+ years of experience in DevOps, Platform Engineering, or SRE, with at least 1-2 years specifically focused on AI/ML infrastructure. Proven track record of building production-grade RAG pipelines or LLM-integrated applications. Thrives in "day zero" environments where the tools and protocols (like MCP) are evolving weekly. Deep understanding of the security implications of LLMs (prompt injection, data leakage, and secure tool execution). Experience working with substantial datasets (over 1bn objects, dozens or hundreds of TBs) and the challenges of leveraging AI tools with these data sets. Bachelor's degree or equivalent in computer science or related field. Required Technical Skills Cloud & Orchestration: AWS/GCP/Azure, Kubernetes, Terraform, Helm. AI Frameworks: LangChain, LlamaIndex, LangGraph. Data & Vectors: Pinecone, Milvus, Qdrant, or pgvector; Apache Kafka/Pulsar; Elasticsearch/OpenSearch; traditional SQL RDBMS. Languages: Python (Expert), TypeScript/Node.js (for MCP development), Go. AI Protocols: Model Context Protocol (MCP), REST/gRPC.
Candidati ora →

TalentyGo è un aggregatore di offerte da fonti pubbliche. Verifica sempre le informazioni direttamente con l'azienda. La candidatura avviene tramite il sito originale dell'azienda; TalentyGo non gestisce processi di selezione.