Job Summary
Invisible Agency is seeking a Spanish language expert specializing in Andean dialects to train AI systems. This fully remote role focuses on creating, curating, and validating high-quality language data to improve AI models, with emphasis on Peruvian, Bolivian, and Ecuadorian Spanish variants.
Responsibilities
- Create and curate Spanish (Andean) language datasets for AI training (transcripts, prompts, intents, and related data).
- Annotate and label linguistic data for quality and consistency (morphology, syntax, semantics).
- Develop and maintain data collection guidelines and documentation.
- Collaborate with ML/AI engineers to refine data requirements and evaluation metrics.
- Perform quality assurance checks on datasets and flag anomalies.
- Ensure data privacy and compliance with applicable policies.
- Support language-specific evaluation and benchmarking of model performance.
- Communicate progress and blockers with a remote, distributed team.
Requirements
- Native or near-native Spanish speaker with strong knowledge of Andean dialects (Peru, Bolivia, Ecuador).
- Experience in data labeling, linguistic annotation, or NLP data curation.
- Familiarity with data labeling tools and basic NLP concepts.
- Excellent attention to detail, organization, and time-management skills.
- Comfortable working in a remote, asynchronous team across time zones.
- Proficiency in English for remote collaboration and communication.
Nice-to-Have
- Basic Python or scripting skills for data QA.
- Experience with AI training pipelines or prompt design.
About Invisible Agency
Invisible Agency is a global, forward-thinking organization delivering language data and AI solutions. This role offers opportunities to contribute to impactful language data initiatives within a collaborative, international team.
Benefits
- Remote, flexible work arrangement with a distributed team.
- Global collaboration and opportunities for professional growth.
- Competitive compensation aligned with experience.