Add analysis of schema structure decomposition of field keys and subtypes #12

ivbeg · 2022-08-06T08:00:34Z

Flat table datasets (CSV) files, database tables, and sometimes objects with nested objects ofter include elements that could be grouped.

For example CSV file Zaara_D.csv
includes following fields: title, text, date, place, placeURL, placeLocation, placeType, reviewScore, avgScore

We could find that prefix 'place' is a subtype identifier. It could be decomposed as
place:

And postfix Score identifies value type, whether integer or float.

Most data tables use case change or "_" symbol as dividers. Very rarely is the '-' symbol also used.

Detection of field groups and decomposition of field names could help with:

Add group detection to the final report as field_group property.

The text was updated successfully, but these errors were encountered:

ivbeg added the enhancement New feature or request label Aug 6, 2022

ivbeg self-assigned this Aug 6, 2022

Provide feedback