Projects / Talend Open Profiler

Talend Open Profiler

Talend Open Profiler (TOP) helps you to profile your data. TOP's ergonomic interface allows you to define metrics (indicators) and collect statistics on your data in a few clicks. It comes with a set of regular expressions that helps you to identify bad data. You can create your own regular expressions and use them in data profiling analyses. A lot of options exist for each indicator, which change the behavior of the indicator so that it gives you more pertinent information. Data quality options on indicators alert you when your data quality is not what you expected.

Tags Information Management Metadata/Semantic Models Database Front-Ends Software Development Quality Assurance Records Management
Licenses GPLv2
Operating Systems OS Independent
Implementation SQL Java
Translations English

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  07 Jul 2009 15:49
  • Rrelease-after

Changes: Indicator's definitions can be edited. The connection of an analysis can be changed. Database support for SQLite3, AS400 was added. A data filter section was added in the redundancy analysis. Empty and null records are highlighted in all frequency tables. Folders are reordered in DQ Repository view. Java computation of column indicators is done. A preference page for the folding policy of the editor sections was added. A preference page for setting for number of analyzed items per page was added.

  • Rrelease-mid
  •  01 Jul 2009 13:35
  • Rrelease-after

    Changes: Some bugs were fixed, including a bug in reloading the column list.

    • Rrelease-mid
    •  22 Jun 2009 11:42
    • Rrelease-after

      Changes: Some bugs were fixed.

      • Rrelease-mid
      •  19 May 2009 11:55
      • Rrelease-after

        Changes: The SQL editor can now be saved. Tables and columns can be compared with the database structure. Documentation was added for the new indicators. Previewing data no longer switches to another perspective. A warning was added for costly analyses. Some SQL syntax errors were fixed.

        • Rrelease-mid
        •  06 May 2009 15:12
        • Rrelease-after

          Changes: New analyses include overview analyses (on connections, catalogs, and schemas), three types of correlation analyses (nominal, time, and numerical), and analyses based on custom DQ rules written as SQL rules. New indicators include soundex frequencies, pattern frequencies, and a count of default values. The ability to drill down into data was added in all analyses. Potential quality issues are highlighted in the results. Regular expressions (patterns) can be imported and exported with the Talend Exchange Web site. Internationalization of the platform was done via the Talend Babili Web site.

          No-screenshot

          Project Spotlight

          Fakeroot Next Gen

          Software that fools a program into thinking it is running as root.

          4b07879d5a5e6363290a5602f791696b_thumb

          Project Spotlight

          DMDirc

          An IRC client.