This is the Linux app named crawlee whose latest release can be downloaded as v3.5.8sourcecode.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named crawlee with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
crawlee
DESCRIPTION
Crawlee is a web scraping and browser automation library. It helps you build reliable crawlers. Fast. Crawlee won't fix broken selectors for you (yet), but it helps you build and maintain your crawlers faster. When a website adds JavaScript rendering, you don't have to rewrite everything, only switch to one of the browser crawlers. When you later find a great API to speed up your crawls, flip the switch back. It keeps your proxies healthy by rotating them smartly with good fingerprints that make your crawlers look human-like. It's not unblockable, but it will save you money in the long run. Crawlee is built by people who scrape for a living and use it every day to scrape millions of pages. Meet our community on Discord. We believe websites are best scraped in the language they're written in. Crawlee runs on Node.js and it's built in TypeScript to improve code completion in your IDE, even if you don't use TypeScript yourself.
Features
- JavaScript & TypeScript
- HTTP scraping
- Headless browsers
- Automatic scaling and proxy management
- Queue and Storage
- Helpful utils and configurability
Programming Language
TypeScript
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/crawlee.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.