Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛Bug: Problem with special Characters in Stream Names #544

Closed
tinomerl opened this issue Dec 2, 2024 · 0 comments · Fixed by #545
Closed

🐛Bug: Problem with special Characters in Stream Names #544

tinomerl opened this issue Dec 2, 2024 · 0 comments · Fixed by #545

Comments

@tinomerl
Copy link
Contributor

tinomerl commented Dec 2, 2024

Preface: This isn't directly a PyAirbyte Bug and more of an OS Bug. Im using Ubuntu 22.04 on WSL.

Description

In some systems custom Tables or the like are possible. If one of these custom tables has a special character in it's name the jsonl.gz file, used to import everything into the DuckDB cache, cannot be created. Therefore the whole sync fails. We have a case where a custom table in a customer's system which contains a backward slash \. Since we are using a custom connector for this customer and it's the system of said customer, i'm not able to share logs. maybe at a later stage we can release the connector to the main airbyte repo.

Possible Fix

In file_writers.py the method _get_new_cache_file_path generates a unique name by adding a uuid / batch_id. By my understanding it should be possible to add a line which would remove special characters in the jsonl.gz file.

Willing to create PR

Yes. Eager to try something out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant