AI Data Engineer at PST.AG

Puesto AI Data Engineer
Publicado 30 Jun 2026
Expirado 30 Jul 2026
Empresa PST.AG
Ubicación Barcelona | ES
Tipo de Contrato Full Time

Descripción del Puesto:

Última información laboral de PST.AG para la posición de AI Data Engineer. If the AI Data Engineer vacante en Barcelona coincide con tus calificaciones, envía tu solicitud o currículum directamente a través del portal actualizado de Jobkos.

Ten en cuenta que aplicar a un trabajo puede no ser siempre fácil, ya que los candidatos deben cumplir con ciertos requisitos establecidos por la empresa. Esperamos que esta oportunidad en PST.AG para la posición de AI Data Engineer se ajuste a tu perfil profesional.

Key Responsibilities Specification-Driven Extraction Engineering: 1. Design and maintain declarative extraction specifications—using Pydantic models, JSON schemas, or domain-specific languages—that describe exactly which fields to capture, their types, and validation rules. 2. Implement pipelines that translate these specifications into executable extraction plans, leveraging both classical (Scrapy, Playwright) and AI-augmented (LLM-based semantic parsing) backends. 3. Build reusable specification libraries for recurring data types (product prices, tariff codes, regulatory texts) to accelerate onboarding of new sources. 4. Design and implement autonomous data extraction agents that can make decisions about source selection, retry logic, and parsing strategies Autonomous & Self-Healing Systems: 1. Deploy self-healing spiders that automatically detect website layout changes and repair themselves using Model Context Protocol (MCP) servers (e.g., Scrapy MCP Server, Playwright MCP). 2. Integrate semantic extraction (Scrapy-LLM, custom LLM pipelines) to eliminate selector brittleness—spiders rely on field descriptions, not fragile XPaths. 3. Hands-on experience building AI agents and orchestration systems. 4. Orchestrate complex, multi-step browsing workflows with agentic frameworks (BMAD/TEA, AutoGPT-like agents) that reason about page state, adapt to anti-bot measures, and correct their own behaviour in real time. Platform Thinking & Reusability: 1. Move beyond one-off scrapers: build a component-based extraction platform where selectors, login handlers, and pagination logic are shared, versioned, and tested. 2. Implement monitoring, alerting, and automatic rollback for failed extraction runs. 3. Champion ethical crawling by design—rate limiting, robots.txt respect, and compliance with GDPR/CCPA are built into the specification layer, not retrofitted. Collaboration & Continuous Innovation: 1. Partner with data scientists and domain experts to refine extraction specifications for complex, unstructured domains (e.g., legal texts, tariff classifications). 2. Evaluate and pilot emerging tools to push automation coverage beyond 90%. 3. Document and evangelise specification-driven best practices across the engineering organisation. Qualification: 1. Bachelor’s degree in Computer Science 2. 3+ years of experience in web scraping or data extraction Required Skills: 1. Proficiency with Python 2. Experience with specification-Driven Extraction 3. Experience with LangChain, LangGraph, LlamaIndex, AutoGen 4. Hands on use of Scrapy LLM, Scrapy MCP Server, or similar systems that decouple field definitions from page structure 5. Familiarity with frameworks that give LLMs browser control (Playwright + MCP, BMAD/TEA) to handle complex, non deterministic crawling tasks. 6. Classical Scraping Fundamentals 7. Data Validation & Storage – Ability to define validation rules within specifications and land clean data into SQL/NoSQL databases or data lake 8. Basic API integration and authentication flows. 9. HTTP, DOM, XPath, CSS. Nice to Haves: 1. Contributions to open-source scraping or AI-automation projects. 2. Contributions to open-source scraping or AI-automation projects. 3. Familiarity with data privacy engineering (GDPR, CCPA) baked into specification design. 4. DevOps light – Docker, CI/CD for testing extraction specifications.

Información de la Vacante:

  • Empresa: PST.AG
  • Puesto: AI Data Engineer
  • Lugar de Trabajo: Barcelona
  • País: ES

Cómo Enviar tu Postulación:

Después de leer y comprender los criterios y requisitos mínimos explicados en la información del trabajo AI Data Engineer at the office Barcelona anterior, completa de inmediato tus archivos de solicitud, como carta de presentación, CV (Hoja de Vida), copia de diploma y otros suplementos. Envía a través del enlace Siguiente Página abajo.

Siguiente Página »

Vacantes Similares

  AI Data Engineer at PST.AG
Publicado: 5 hours ago

Desc: Key Responsibilities Specification-Driven Extraction Engineering: 1. Design and maintain declarative extraction specifications—using Pydantic models, JSON schemas, or domain-specific languages—that de...

Empresa: PST.AG | Ubicación: Barcelona

  Operario/a de cisternas at domestiko.com
Publicado: 7 hours ago

Desc: Se busca operario o operaria de cisternas para trabajar en una empresa del sector químico en Barberà del Vallès. Las tareas incluyen carga y descarga de cisternas con bombas centrífugas y neumáticas,...

Empresa: domestiko.com | Ubicación: Barcelona