Skip to content
View AndreaBozzo's full-sized avatar
:octocat:
:octocat:

Block or report AndreaBozzo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AndreaBozzo/README.md

πŸ‘‹ Andrea Bozzo

Senior Data Architect | Systems Programmer | Open Source Creator

My path into technology wasn't a straight line, but an inevitable evolution. I've spent my life as a problem-solver, and with a keyboard in front of me for as long as I can remember, programming became my natural language for building solutions.

This journey took me from a background as a Chartered Accountant, through data analysis, and ultimately deep into systems engineering. On my blog, I share technical deep-dives into the projects I contribute to.

Expect complex jargon and unfiltered details as it's written for fellow nerds builders.

I have a few personal projects like dataprof and, more recently, Ceres.

🌐 Landing Page β€’ πŸ“ Blog β€’ πŸ’Ό LinkedIn β€’ πŸ“§ Email

profile views


πŸ› οΈ Tech Stack

Core Languages Rust Python
Data & Architecture Data Lakehouse Apache Arrow PostgreSQL
Cloud & DevOps Azure Docker GitHub Actions


🌟 Open Source Contributions

Contributing to the broader open source ecosystem beyond my own projects.

This section is automatically updated daily via GitHub Actions

  • pola-rs/polars ⭐ 36471 - 2 merged PRs
    • Extremely fast Query Engine for DataFrames, written in Rust
  • risingwavelabs/risingwave ⭐ 8586 - 1 merged PR
    • Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.
  • supabase/etl ⭐ 2060 - 1 merged PR
    • Stream your Postgres data anywhere in real-time. Simple Rust building blocks for change data capture (CDC) pipelines.
  • datapizza-labs/datapizza-ai ⭐ 2031 - 3 merged PRs
    • Build reliable Gen AI solutions without overhead πŸ•
  • mariocandela/beelzebub ⭐ 1718 - 1 merged PR
    • A secure low code honeypot framework, leveraging AI for System Virtualization.
  • lakekeeper/lakekeeper ⭐ 1077 - 2 merged PRs
    • Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
  • italia-opensource/awesome-italia-opensource ⭐ 311 - 1 merged PR
    • Italian Open-Source is the first platform dedicated to Italian open-source world
  • pganalyze/pg_query.rs ⭐ 205 - 1 merged PR
    • Parse, deparse and normalize SQL queries using the Postgres source code
  • CortexFlow/CortexBrain ⭐ 67 - 3 merged PRs
    • CortexBrain is an ambitious open-source project created by CortexFlow, aiming to develop an intelligent, lightweight, and efficient service mesh architecture that seamlessly connects cloud and edge devices
  • piopy/fantacalcio-py ⭐ 41 - 4 merged PRs
    • Piccolo tool per guidarci all'asta spendendo poco

πŸ“Š GitHub Stats

GitHub Stats Top Languages GitHub Streak

Contribution Graph


🀝 Let's Connect

I'm always open to connecting with fellow developers and data enthusiasts. Let's talk about data engineering, system architecture, or your favorite open-source project.

  • Ask me about: Data pipelines, ETL design, Rust best practices, and system architecture.
  • Looking to collaborate on: Data engineering tools, high-performance libraries (Rust/Python), and interesting open-source projects.
  • Fun fact: I still write SQL, but now I optimize the engine running it.

Pinned Loading

  1. Ceres Ceres Public

    Semantic search engine for open data portals built with tokio and pgvector.

    Rust 1

  2. dataprof dataprof Public

    Fast, reliable data quality assessment for CSV, Parquet, and databases

    Rust 8 1

  3. Peek-a-Boo Peek-a-Boo Public

    Minimalist AI agent that extracts information from files using surgical grep/ls operations to minimize token usage. Built with Datapizza + Google Gemini.

    Python

  4. rust-ita/rust-docs-it rust-ita/rust-docs-it Public

    Documentazione Rust tradotta in italiano

    Shell 2 1

  5. Osservatorio Osservatorio Public archive

    Osservatorio - Open Data Processing Platform ( WIP)

    Python 5 5