Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/support cosmos #145

Draft
wants to merge 59 commits into
base: main
Choose a base branch
from
Draft

Feature/support cosmos #145

wants to merge 59 commits into from

Commits on Dec 12, 2022

  1. Configuration menu
    Copy the full SHA
    48394f1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b6b48a3 View commit details
    Browse the repository at this point in the history

Commits on Dec 18, 2022

  1. Configuration menu
    Copy the full SHA
    fa5c9c4 View commit details
    Browse the repository at this point in the history
  2. Testing Overhaul

    * Adding tests/environment folder to store datasets and bicep templates for test sources
    * Added scripts to create databricks jobs and a notebook to mount storage on Databricks
    * Making test environments more consistent across notebooks (secret scope, environment variables)
    * Handle of tests were modified to correct mistakes not caught in source controlled versions
    * Added documentation for testing environment including what secrets are used and what they look like
    * Adding requirements.txt file for environment deployment
    * Hive tests should run without additional intervention (i.e. use CREATE IF NOT EXISTS)
    * Removing production env deployment
    * Remove the wasbs with parameters test
    * After updating all jobdefs to be ready for upload, the run-tests script needed to look at .name instead of .settings.name
      * Unfortunately, when calling the jobs API, it returns a .settings.name which must be used
    wjohnson committed Dec 18, 2022
    Configuration menu
    Copy the full SHA
    9c71e45 View commit details
    Browse the repository at this point in the history

Commits on Jan 6, 2023

  1. Resolving race condition based on app settings being deployed while m…

    …s deploy is going
    
    and event hub authorization rules deploying at the same time.
    
    Co-authored-by: hmoazam <[email protected]>
    wjohnson and hmoazam committed Jan 6, 2023
    Configuration menu
    Copy the full SHA
    73e656e View commit details
    Browse the repository at this point in the history

Commits on Feb 12, 2023

  1. Bump cryptography from 38.0.4 to 39.0.1 in /tests/environment

    Bumps [cryptography](https://github.com/pyca/cryptography) from 38.0.4 to 39.0.1.
    - [Release notes](https://github.com/pyca/cryptography/releases)
    - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst)
    - [Commits](pyca/cryptography@38.0.4...39.0.1)
    
    ---
    updated-dependencies:
    - dependency-name: cryptography
      dependency-type: direct:production
    ...
    
    Signed-off-by: dependabot[bot] <[email protected]>
    dependabot[bot] authored and wjohnson committed Feb 12, 2023
    Configuration menu
    Copy the full SHA
    612f4c0 View commit details
    Browse the repository at this point in the history

Commits on Apr 28, 2023

  1. Added unit test and integration test for Azure MySQL, and updated LIM…

    …ITATIONS.md to indicate MySQL support (#149)
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    b74ad2b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6f42108 View commit details
    Browse the repository at this point in the history
  3. No longer supporting Spark 2

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    e83d8ac View commit details
    Browse the repository at this point in the history
  4. Updates - Postgres (#148)

    * Adding bicep file
    
    * Added unit and integration tests for Postgres. Updated LIMITATIONS.md to describe support and limitations
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    7099edd View commit details
    Browse the repository at this point in the history
  5. Feature/support kusto (#147)

    * Implemented support for kusto by updating mappings.
    
    * Implemented Kusto support and added unit and integration tests
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    6296607 View commit details
    Browse the repository at this point in the history
  6. Refactoring SelectReturnEntity to be more clear

    * checking for resource set types as its own method
    * checking for blob or data lake file path or filesystem types as its own method
    * refactored checking for adf relationships for blob/data lake file path types to its own method
    * Behavior Change for blob/data lake file path types that match: follow the insert into first position of valid entities rather than return a list with only the entity
    * Renamed ResourceSet_QualifiedNames_Match to QualifiedNames_Match_After_Normalizing and added the trim of the trailing slash for comaprisons
        * Using QualifiedNames_Match_After_Normalizing when comparing as the dfs vs blob comparison was preventing folder paths from matching if they're using wasbs but scanned for azure data lake gen2 (abfss)
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    f59a347 View commit details
    Browse the repository at this point in the history
  7. Refactoring SelectReturnEntity to reflect we only accept entities wit…

    …h matching fully qualified names
    
    By pulling out and putting first the QualifiedNames_Match_After_Normalizing, it's clear that all search result entities must have a matching FQN.
    
    This puts a stronger emphasis on the Build_Searchable_QualifiedName and its ability to standardize the name.
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    dbccd6a View commit details
    Browse the repository at this point in the history
  8. Fix spark3-test-def merge conflict

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    2636d40 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    45f65db View commit details
    Browse the repository at this point in the history
  10. Refactoring validentity to validEntitiesAfterFiltering to make it mor…

    …e clear that filtering has occured
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    79410ce View commit details
    Browse the repository at this point in the history
  11. Removing unncessary column mapping comments and Validate_Resource_Set…

    … method which is never referenced
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    61ea232 View commit details
    Browse the repository at this point in the history
  12. Refactoring PurviewIngestion to remove unused methods and refactor ou…

    …r the Log method in favor of using the ILogger directly
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    d53f45f View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    a947969 View commit details
    Browse the repository at this point in the history
  14. Refactoring SendToPurview in PurviewIngestion for each loop's variabl…

    …e from a simple entity to purviewEntityToBeUpdated to make it more clear what's going to happen to this entity and not confuse it with other references to the word 'entity'
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    f061826 View commit details
    Browse the repository at this point in the history
  15. Refactoring PurviewIngestion naming conventions to clarify entities t…

    …hat may be deleted and adding comments to walk through relationship attribute extraction and replacement
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    d3dd7af View commit details
    Browse the repository at this point in the history
  16. Adding a field for originalQualifiedName and removing unused methods …

    …simpleEntity, AddToTable, FindQualifiedNameInPurview, CleanUnusedCustomEntities, and Name_To_Search
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    d732ef7 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    48dc0c6 View commit details
    Browse the repository at this point in the history
  18. ColParser should optionally take in a mapping of original dataset nam…

    …es to fully qualified names discovered during search and PurviewIngestion needs to keep track of these mappings
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    50b0bf5 View commit details
    Browse the repository at this point in the history
  19. Refactoring to support extracting the ColumnParser to be passed aroun…

    …d in PurviewOut
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    48af630 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    ada23c9 View commit details
    Browse the repository at this point in the history
  21. Updating Limitations and Readme to better reflect current state and s…

    …upport of column lineage
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    0c68b4a View commit details
    Browse the repository at this point in the history
  22. Handle Azure Data Factory Job Names (#137)

    Truncate Azure Data Factory job name guid to prevent creating duplicate jobs / tasks only differntiated by a guid / pipeline id
    Job name should be ADF_factoryName_pipelineName
    Task name should be ADF_factoryName_pipelineName_activityName
    ADF Regex pattern should not ignore case
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    eddfa25 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    3fd940d View commit details
    Browse the repository at this point in the history
  24. Update ADF and Kusto limitations (#169)

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    f4b166b View commit details
    Browse the repository at this point in the history
  25. Update Delta Merge support (#167)

    Closes #156
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    811a7ae View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    014183b View commit details
    Browse the repository at this point in the history
  27. OL 13 -> 18 (#173)

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    fbabc92 View commit details
    Browse the repository at this point in the history
  28. Configuration menu
    Copy the full SHA
    5e65747 View commit details
    Browse the repository at this point in the history
  29. Configuration menu
    Copy the full SHA
    39dcd36 View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    2c58203 View commit details
    Browse the repository at this point in the history
  31. Fix Library Definitions in Job Tasks to prevent deserialization error…

    … from JSON to class models
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    b6b1e87 View commit details
    Browse the repository at this point in the history
  32. OlToPurviewMapping Quality of Dev Improvements

    Python script to remove spaces and added as an artifact to Build and Release (Closes #183)
    Python script to update the arm template with the OlToPurviewMapping in a stringified json format (Closes #184)
    
    Updated the arm template with the standardized response from the new python script to avoid conflicts later on
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    bc40c97 View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    46ae5b9 View commit details
    Browse the repository at this point in the history
  34. Correct one line mappings as artifact

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    05a54b2 View commit details
    Browse the repository at this point in the history
  35. Fix Mappings for Mount Points with Subdirectories

    Mount points that have a source with a subdirectory were failing to include the subdirectories
    
    Introduced a new namespaceBodyPartsJoinedBySlashFrom mappings variable to capture any parts from
    a certain position and concatenate them with a forward slash.
    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    6fd492e View commit details
    Browse the repository at this point in the history
  36. Configuration menu
    Copy the full SHA
    7502ed5 View commit details
    Browse the repository at this point in the history
  37. Testin

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    c0f65d1 View commit details
    Browse the repository at this point in the history
  38. Configuration menu
    Copy the full SHA
    c9f0189 View commit details
    Browse the repository at this point in the history
  39. Remove unncessary comment

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    51678a0 View commit details
    Browse the repository at this point in the history
  40. Configuration menu
    Copy the full SHA
    6d8df7d View commit details
    Browse the repository at this point in the history
  41. Mappings must be a separate artifact

    wjohnson authored and hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    c2bf960 View commit details
    Browse the repository at this point in the history
  42. Implementing Cosmos support

    Rebased to include updates to newdeploymenttemp.json
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    c73398b View commit details
    Browse the repository at this point in the history
  43. Cosmos integration test added, and test-env README updated. TODO: Upd…

    …ate LIMITATIONS, add integration test notebooks, and add integration test for additional inputs + cosmos
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    ba6c4aa View commit details
    Browse the repository at this point in the history
  44. Configuration menu
    Copy the full SHA
    7f24fa2 View commit details
    Browse the repository at this point in the history
  45. Cosmos support WIP

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    9f3714c View commit details
    Browse the repository at this point in the history
  46. Cosmos WIP

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    b67bbe1 View commit details
    Browse the repository at this point in the history
  47. Cosmos WIP

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    50f3129 View commit details
    Browse the repository at this point in the history
  48. Update LIMITATIONS

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    00b18d4 View commit details
    Browse the repository at this point in the history
  49. Updated UnitTestData, test CompleteNoOutputsInputsFullMessage expecte…

    …d result to True, as now considering COMPLETE events with only outputs
    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    247e56a View commit details
    Browse the repository at this point in the history
  50. Configuration menu
    Copy the full SHA
    c3bbfdd View commit details
    Browse the repository at this point in the history
  51. Cosmos WIP

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    e008feb View commit details
    Browse the repository at this point in the history
  52. Unsuccessful debugging

    hmoazam committed Apr 28, 2023
    Configuration menu
    Copy the full SHA
    2da877e View commit details
    Browse the repository at this point in the history
  53. Configuration menu
    Copy the full SHA
    edd5657 View commit details
    Browse the repository at this point in the history