Python Engineer to Architect High-Volume Data Pipeline (Social Engagement Data)

Remote Full-time
We are a data agency looking to replace an expensive legacy vendor with an in-house solution. We need a Senior Python Developer to build a high-efficiency data pipeline that aggregates public engagement data (Likes/Comments) from professional social networks. The Goal: Build a "Glass Box" scraper that runs on our cloud infrastructure. We want full ownership of the code and direct billing for the underlying resources (Proxies/APIs). The Specs (Must Have): - Volume: Capability to process 200,000 - 300,000 lookups per week. - Inputs: We provide post URLs or Keywords. - Outputs: CSV/JSON with User Name, Headline, and Profile URL. Cost Constraint: The system must operate (infrastructure wise) for under $1,200/month at full volume. The Architecture: We believe the best approach is a Python script leveraging enterprise APIs to handle the heavy lifting (e.g., Apify, Scrapingdog, or Bright Data). We do not want a Selenium bot running on a laptop. We want a cloud-deployed script (AWS Lambda/DigitalOcean) that manages rotation and rate limits via these APIs. Requirements: Deep experience with Apify Actors or Scrapingdog. Experience with Residential Proxies (configuring bandwidth to minimize waste). Ability to parse large JSON datasets efficiently. Ownership: You build it, we own the code. To Apply: Please tell me which API or Proxy provider you would recommend to hit a volume of 300k/week while keeping ongoing tech costs under $1,200/month. Apply tot his job
Apply Now

Similar Opportunities

Data Modeler remote

Remote Full-time

Sr Data Modeler

Remote Full-time

Data Modeler (Only local to Lincoln, NE consultants)

Remote Full-time

[Hiring] Senior Healthcare Data Modeler @Abacus Insights

Remote Full-time

DATA ENGINEER (DATA MODELING) | COLUMBUS, OH (REMOTE)

Remote Full-time

Remote Data Modeling

Remote Full-time

Data Modeler banking industry Columbia, SC aremote

Remote Full-time

Principal Data Modeler and Database Engineer (Onsite)

Remote Full-time

Senior Data Modeler Leader (Data Warehousing & Governance)

Remote Full-time

Experienced Full Stack Data Product Manager – Data Modeling Focus for arenaflex

Remote Full-time

Remote Case Manager - Stearns County

Remote Full-time

Experienced Remote Data Entry and Customer Service Representative – Part-Time Work from Home Opportunity with Flexible Hours

Remote Full-time

Dispatcher l (3rd Shift)

Remote Full-time

**Experienced Remote Data Entry Specialist – Flexible National & Local Paid Focus Groups, Clinical Trials, and Phone Interviews**

Remote Full-time

**Experienced Remote Data Entry Clerk - CVS Health: Launch Your Career with Comprehensive Training and Flexible Work Arrangements**

Remote Full-time

Experienced Chat Support Agent for Innovative Gig Staffing Platform – Remote Opportunity with Competitive Hourly Rate

Remote Full-time

Remote Policy Advisor

Remote Full-time

Experienced Bilingual Spanish and English Licensed Property and Casualty Customer Service Representative for Remote Work Opportunity at blithequark

Remote Full-time

Research Scientist - Health Care Policy Research

Remote Full-time

Manager, Head of Risk Standards

Remote Full-time
← Back to Home