This is the Linux app named tika-python whose latest release can be downloaded as ApacheTika2.6.0Release.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named tika-python with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
tika-python
DESCRIPTION
A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
Features
- Parser Interface (backwards compat prior to REST)
- The parser interface extracts text and metadata using the /rmeta interface
- Optionally, you can pass Tika server URL along with the call what's useful for multi-instance execution
- Specify Output Format To XHTML
- The unpack interface handles both metadata and text extraction in a single call
- Internally returns back a tarball of metadata and text entries that is internally unpacked
Programming Language
Python
Categories
This is an application that can also be fetched from https://sourceforge.net/projects/tika-python.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.