This is the Linux app named dude uncomplicated data extraction whose latest release can be downloaded as EnablePoetryvirtualenv.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named dude uncomplicated data extraction with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS:
dude uncomplicated data extraction
DESCRIPTION:
Dude is a very simple framework for writing web scrapers using Python decorators. The design, inspired by Flask, was to easily build a web scraper in just a few lines of code. Dude has an easy-to-learn syntax. Dude is currently in Pre-Alpha. Please expect breaking changes. You can run your scraper from terminal/shell/command-line by supplying URLs, the output filename of your choice and the paths to your python scripts to dude scrape command.
Features
- Minimal web scraper
- The output in data.json should contain the actual URL and the metadata prepended with underscore
- Simple Flask-inspired design - build a scraper with decorators
- Uses Playwright API - run your scraper in Chrome, Firefox and Webkit and leverage Playwright's powerful selector engine supporting CSS, XPath, text, regex, etc.
- Data grouping - group related results
- URL pattern matching - run functions on matched URLs
- Setup function - enable setup steps (clicking dialogs or login)
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/dude-uncomp-data-ext.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.