Python Automation Engineer – Multi-Source Scraping & Data Pipeline Build

Remote Full-time
We are looking for a Python automation engineer to build a fully automated data pipeline that gathers AI company data from multiple sources (APIs + web scraping), deduplicates it intelligently, and outputs clean structured data to Airtable or Notion on a weekly schedule. You must have proven experience building production-grade scrapers, not basic scripts. Required: Strong Python (Scrapy, BeautifulSoup, requests) API integrations (REST, authenticated APIs) Experience automating recurring pipelines (cron jobs, scheduled tasks, etc.) Data cleaning, deduplication logic, CSV/JSON handling Ability to write clean, well-structured code Nice to have (not required): Selenium or Playwright Experience with Airtable/Notion API Experience with LLMs for data enrichment Deliverables: Scrapers for multiple AI-related sources (APIs + websites) Deduplication + merging logic across sources Weekly automated update pipeline Output to Airtable/Notion in structured columns Clear documentation so we can maintain it long-term This project should take 2–3 weeks to build, with optional monthly maintenance. If you’ve built multi-source scrapers before, please apply with examples. Apply tot his job
Apply Now →

Similar Jobs

Senior Marketing Data Engineer

Remote

Data Analyst/Engineer - Salesforce, Stripe, Snowflake & Hex Pipelines - Contract to Hire

Remote

Data Engineer- ETL / ELT - Hybrid / Remote (Columbus)

Remote

Principal Consultant (Data Protection SME)

Remote

Cyber Security Engineer (Data Loss Prevention) - Birmingham

Remote

Staff Product Manager, SaaS Data Protection - Salesforce

Remote

Data Security & Compliance Advisor

Remote

Data Privacy Officer

Remote

Data Protection & Classification Specialist

Remote

Technical Product Manager – Data and Infrastructure

Remote

Urgent Care Nurse Practitioner, Weekend Only

Remote

**Experienced Full Stack Live Chat Agent – Web & Cloud Application Development**

Remote

**Experienced Live Chat Agent – Delivering Exceptional Customer Experiences at blithequark**

Remote

Experienced Full Stack Customer Support Specialist – Remote Work Opportunity with blithequark

Remote

[Remote] Software Engineering Intern – Platform, Integrations & Automation

Remote

Library of Congress 2024 Archives, History and Heritage Advanced (AHHA) Internship Program (Remote Internship) in United States

Remote

**Experienced Customer Care Specialist – Remote Virtual Customer Service Representative**

Remote

Recreation Leader – Amazon Store

Remote

**Experienced Full Stack Customer Support Agent – Live Chat Opportunity with $25-$35/Hour Earnings – blithequark**

Remote

Experienced Remote Data Entry Specialist – Home-Based Opportunity for Detail-Oriented Individuals with Strong Organizational Skills

Remote
← Back