tika-python download for Linux

This is the Linux app named tika-python whose latest release can be downloaded as ApacheTika2.6.0Release.zip. It can be run online in the free hosting provider OnWorks for workstations.

 
 

Download and run online this app named tika-python with OnWorks for free.

Follow these instructions in order to run this app:

- 1. Downloaded this application in your PC.

- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 3. Upload this application in such filemanager.

- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.

- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.

- 6. Download the application, install it and run it.

SCREENSHOTS:


tika-python


DESCRIPTION:

A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.



Features

  • Parser Interface (backwards compat prior to REST)
  • The parser interface extracts text and metadata using the /rmeta interface
  • Optionally, you can pass Tika server URL along with the call what's useful for multi-instance execution
  • Specify Output Format To XHTML
  • The unpack interface handles both metadata and text extraction in a single call
  • Internally returns back a tarball of metadata and text entries that is internally unpacked


Programming Language

Python


Categories

Text Processing, Healthcare, Machine Learning

This is an application that can also be fetched from https://sourceforge.net/projects/tika-python.mirror/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.



Latest Linux & Windows online programs


Categories to download Software & Programs for Windows & Linux