Skip to content
Available for new projects

Build data systems that actually think

Senior Data Engineer · AI-Powered Data Applications · Cloud

  • Are your data pipelines slow, fragile, or impossible to maintain?

  • Do you need cloud infrastructure that scales without constant firefighting?

  • Want to integrate LLMs into your data workflows — without the hype?

  • Looking for an engineer who understands both the data stack and the business?

  • Need production-grade delivery, not just prototypes?

Book an Intro Call

5+ years of experience
$750K in costs reduced
2,000+ hours automated
5 cloud certifications

Portrait of Andres Avila, Senior Data Engineer

About me

Hi, I'm Andres — a Senior Data Engineer based in México, with 5+ years designing and delivering high-impact cloud data systems for enterprise companies including Hershey's, EY/Microsoft, and Wizeline.

I specialize in AI-powered data systems: combining robust cloud pipeline engineering (Azure, AWS, Snowflake, dbt) with LLM integration to help teams move from raw data to operational intelligence. My work spans the full data lifecycle — from architecture and ingestion to transformation, orchestration, and business-facing analytics.

I'm currently open to Backend Contracts — long-term engagements with teams that need a senior data engineer without the overhead of a full-time hire.

What I can help you with

  • Cloud Data Engineering


    I design and build production-grade pipelines on Azure, AWS, and Snowflake. From raw ingestion to semantic modeling with dbt — clean, observable, maintainable infrastructure that scales with your business.

  • LLM Integration


    I connect your structured data to language models to automate analysis, classification, and reporting. Real production experience — not just demo projects. See biopanel.io for a live example.

  • Analytics Engineering


    Semantic layers, metric definitions, and dashboards your entire team can trust. I've built reporting infrastructure used by analytics and data science teams at Fortune 500 companies.

  • Data Migration


    I've led client-specific migration workflows with full validation pipelines: extraction, transformation, profiling, defect detection, and deployment — with documentation your team can maintain.

Why work with me?

  • Certified across all major clouds


    AWS Data Engineer Associate · Databricks Certified Data Engineer Professional · Snowflake SnowPro Core · Azure Data Engineer Associate (DP-203) · Fabric Data Engineer Associate (DP-700)

  • Quantified, enterprise-scale impact


    I don't just build pipelines — I reduce $750K in vendor costs, cut 2,000+ hours of manual work, and optimize runtimes by 67%. Every engagement I've had has a measurable business outcome.

  • Full-stack when it matters


    I built biopanel.io end-to-end: FastAPI backend, Celery + Redis async pipeline, PostgreSQL, React frontend. I can talk to your product and engineering teams in the same language.

  • Async-first, enterprise-grade delivery


    I've worked remotely across EY, Microsoft, Hershey's, and Wizeline. Clear communication, weekly updates, detailed documentation — you always know where things stand.

Tech stack

Cloud · Azure (ADF, Synapse, Fabric, Functions) · AWS (S3, Glue, Redshift, Lambda) · Snowflake · GCP basics

Orchestration & Transformation · Apache Airflow · dbt · PySpark · Azure Data Factory

AI / LLM · OpenAI API · LangChain · Pinecone · Retrieval-Augmented Generation · Streamlit

Languages · Python · SQL · DAX · Shell/Bash · YAML · TypeScript (basics)

DevOps · Azure DevOps · GitHub Actions · Docker · Terraform · CI/CD pipelines

Frequently asked questions

What kind of contracts are you available for?

I'm primarily open to Backend Contracts (BECs) — long-term staff augmentation engagements (typically 20–32 hrs/week) where I embed as a senior data engineer in your team. I'm also open to well-scoped project-based contracts for data migrations, pipeline buildouts, or LLM integration work. Minimum engagement: 20 hours.

What industries have you worked in?

Consumer goods (Hershey's), professional services and enterprise software (EY/Microsoft), technology (Wizeline), and health tech (biopanel.io — personal project). I'm most effective with enterprise teams that have complex data at scale — fintechs, SaaS platforms, and healthcare are strong fits.

How do you handle confidentiality?

I sign NDAs before any project begins. All credentials and sensitive data are managed with enterprise-grade security practices — never hardcoded, always encrypted at rest and in transit. I can work within your existing security policies.

What's your typical engagement structure?

I prefer a short paid discovery phase (5–10 hours) to fully understand your architecture and data before committing to a full engagement. This eliminates surprises on both sides. From there, I work in weekly cycles with async updates and a standing check-in.

Where are you based? Do you work with international clients?

I'm based in Aguascalientes, México (UTC-6). I work remote-first and have experience with distributed teams across US, EU, and LATAM timezones. Fluent in English and Spanish.

What are your rates?

I price based on scope and value delivered, not hours logged. Let's talk during our intro call — I'll be direct about what makes sense for your situation.

  • Let's see if we're a fit


    I take on a limited number of clients at a time to ensure quality. Book a free 30-minute call to discuss your data challenges and what an engagement could look like.

    Book Intro Call