This is the Linux app named DataCleaner to run in Linux online whose latest release can be downloaded as DataCleaner-all.zip. It can be run online in the free hosting provider OnWorks for workstations.
Download and run online this app named DataCleaner to run in Linux online with OnWorks for free.
Follow these instructions in order to run this app:
- 1. Downloaded this application in your PC.
- 2. Enter in our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 3. Upload this application in such filemanager.
- 4. Start the OnWorks Linux online or Windows online emulator or MACOS online emulator from this website.
- 5. From the OnWorks Linux OS you have just started, goto our file manager https://www.onworks.net/myfiles.php?username=XXXXX with the username that you want.
- 6. Download the application, install it and run it.
SCREENSHOTS
Ad
DataCleaner to run in Linux online
DESCRIPTION
DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging.Website: http://datacleaner.github.io
Features
- Profiles and analyzes your database within minutes!
- Access almost any datastore - Oracle, MySQL, PostgreSQL, MS SQL Server, MongoDB, CUBRID, CSV files, Excel spreadsheets, dbase and more
- Discover patterns in your textual data with the Pattern Finder
- Find out which values occur the most with the Value Distribution profile
- Cleanse your contact details with name and address validations
- Detect duplicates using fuzzy logic and configurable weights and thresholds
- Merge your duplicates and create a single version of the truth
- Write data back to relational databases, CSV files, Excel spreadsheets or MongoDB databases
Audience
Information Technology, Science/Research, Quality Engineers
User interface
Java Swing, Web-based
Programming Language
Java
Database Environment
Project is a database management tool, Project is a database conversion tool, XML-based, HSQL, JDBC, Oracle, MySQL, PostgreSQL (pgsql), SQLite, Other network-based DBMS, Firebird/InterBase, Microsoft SQL Server, Flat-file
Partners
Human Inference is the European market leader in data quality solutions. The solutions are based on natural language processing and contain a core of knowledge to provide our customers with the best quality possible.
Neopost Customer Information ManagementNeopost Customer Information Management is a set of solutions and services that covers the entire lifecycle of customer information and communication management.
This is an application that can also be fetched from https://sourceforge.net/projects/datacleaner/. It has been hosted in OnWorks in order to be run online in an easiest way from one of our free Operative Systems.