Skip to content
@apify

apify

We're making the web more programmable.

Apify Banner

Apify is the largest ecosystem where developers build, deploy, and publish data extraction and web automation tools. We call them Actors.

Learn About Apify 🧑‍🎓

  • Find hundreds of ready-made Actors for your web scraping or automation project on Apify Store.
  • Learn everything about web scraping and automation with our free courses that will turn you into an expert scraping developer.
  • Publish your web scrapers as paid Actors on the Apify platform, attract people who need these solutions, and get regular passive income.
  • View our livestreams and video content at the Apify YouTube channel.
  • Learn more through tutorials and thought leadership content about web scraping on Apify Blog and Crawlee Blog.

We are hiring! 🕸️

Check out the open positions at Apify and help us make the web more programmable.

Pinned Loading

  1. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 21.5k 1.2k

  2. impit impit Public

    impit | rust library for browser impersonation

    Rust 337 28

  3. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 8k 614

  4. apify-mcp-server apify-mcp-server Public

    The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and aut…

    TypeScript 748 99

  5. mcp-cli mcp-cli Public

    mcpc is a CLI client for MCP. It supports persistent sessions, stdio/HTTP, OAuth 2.1, JSON output for code mode, proxy for AI sandboxes, and much more.

    TypeScript 257 10

  6. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chaining.

    JavaScript 974 162

Repositories

Showing 10 of 199 repositories
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 21,523 Apache-2.0 1,189 134 (1 issue needs help) 31 Updated Feb 4, 2026
  • apify-sdk-js Public

    Apify SDK monorepo

    apify/apify-sdk-js’s past year of commit activity
    MDX 171 Apache-2.0 61 22 5 Updated Feb 4, 2026
  • apify/apify-dify-integration’s past year of commit activity
    Python 0 0 0 2 Updated Feb 4, 2026
  • impit Public

    impit | rust library for browser impersonation

    apify/impit’s past year of commit activity
    Rust 337 Apache-2.0 28 9 3 Updated Feb 4, 2026
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    JavaScript 62 Apache-2.0 168 117 (3 issues need help) 25 Updated Feb 4, 2026
  • apify-mcp-server Public

    The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.

    apify/apify-mcp-server’s past year of commit activity
    TypeScript 748 MIT 99 20 7 Updated Feb 4, 2026
  • apify-sdk-python Public

    The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

    apify/apify-sdk-python’s past year of commit activity
    Python 161 Apache-2.0 21 19 4 Updated Feb 4, 2026
  • apify-client-js Public

    Apify API client for JavaScript / Node.js.

    apify/apify-client-js’s past year of commit activity
    TypeScript 82 Apache-2.0 44 26 5 Updated Feb 4, 2026
  • apify-client-python Public

    Apify API client for Python

    apify/apify-client-python’s past year of commit activity
    Python 89 Apache-2.0 15 12 6 Updated Feb 4, 2026
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 8,017 Apache-2.0 614 70 4 Updated Feb 4, 2026