Software Engineer, Data Infrastructure & Acquisition - Phoenix, AZ, USA
RemoteHunter
📍 United States, US0💼 Tempo pieno🕐 23 giorni fa
Candidati ora →
Crea un account gratis in 30 secondi: ottieni anche il match score AI con il tuo CV.
Descrizione
About Our Client
The organization operates in the artificial intelligence and audio technology sector, focusing on developing advanced models supported by large-scale, high-quality datasets. It addresses complex challenges related to data collection and management at a petabyte scale by tightly integrating infrastructure, engineering, and research to efficiently support model training for both consumer and enterprise applications.
About the Opportunity
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing and enhancing the massive data collection processes that fuel foundational AI model training. This role focuses on sourcing and ingesting large volumes of audio data, optimizing complex cloud infrastructure, and collaborating across teams to improve data quality and cost-efficiency. The position plays a key part in shaping the dataset roadmap to advance next-generation AI products.
Responsibilities
Data Sourcing: Proactively identify, evaluate, and acquire new audio data sources for high-throughput ingestion.
Infrastructure Management: Operate, maintain, and develop robust cloud infrastructure for data ingestion pipelines on Google Cloud Platform (GCP) using Terraform.
Pipeline Optimization: Collaborate closely with research scientists to optimize pipeline cost, data throughput, and overall dataset quality.
Strategic Collaboration: Work alongside the core AI team and executive leadership to define, iterate, and execute the company's long-term dataset strategy.
Requirements
Education: BS, MS, or PhD in Computer Science or a closely related quantitative field.
Professional Experience: Over 5 years of professional software development experience.
Technical Scripting: Deep proficiency in bash and Python scripting within Linux environments.
DevOps & Cloud Platforms: Hands-on experience with Docker, Infrastructure-as-Code (IaC) principles, and major cloud platforms (GCP experience is highly preferred).
Data Processing (Plus): Direct knowledge of building web crawlers and handling large-scale data processing systems is considered a strong plus.
Core Competencies: Strong verbal and written communication skills, with a proven ability to manage multiple priorities and adapt quickly to changing project needs.
Compensation and Benefits
Compensation
Base Salary Range: $140,000 to $200,000 per year (United States base salary for this full-time position).
Additional Incentives: Performance-based bonus structures and high-upside company equity packages, scaling with depth of experience.
Equal Opportunity Statement
Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
About RemoteHunter
RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose in this opportunity is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company’s career page or ATS.
TalentyGo è un aggregatore di offerte da fonti pubbliche. Verifica sempre le informazioni direttamente con l'azienda. La candidatura avviene tramite il sito originale dell'azienda; TalentyGo non gestisce processi di selezione.