PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
-
Updated
Jul 7, 2024 - Python
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A repository containing comprehensive data on real estate property transactions, encompassing transaction details, property characteristics, and market insights for analytical purposes in the real estate industry.
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Singer Tap for dbt API v2 built with the Meltano SDK
Singer tap for the StackExchange API
Crawly, a high-level web crawling & scraping framework for Elixir.
Extracting Data Of Scanned Images
Python program which extracts some data from a specific Word document used in my company. Without this program data used to be extracted manually, opening hundred of Word documents one by one to copy/past some informations on an Excel file. Now it is fully automatic.
This example demonstrates how to update the extract data file at runtime.
This example demonstrates how to create the Extract data source, replace existing dashboard data sources with Extract data sources and update the Extract data file.
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
Export definitions, and notes regarding how they work, for extracting data from MySchoolSask (an implementation of Follett Aspen)
🇺🇸 Solution for importing and analyzing public Brazilian business data (CNPJ). 🇧🇷 Processamento de Dados CNPJ: Uma solução robusta e conteinerizada para importação e análise de dados empresariais brasileiros (CNPJ).
This is a repo to face recognition on 5 famous people
PDFix SDK samples for Java Maven. PDF manipulation, content extraction, conversion , accessibility and more...
Easily scrape 10,000+ email messages in one hour, helping you quickly increase your customers Extracts data from (LinkedIn, Facebook, Instagram, Youtube, Pinterest, Twitter) Perfect search by specific Keywords Ready-to-use Social Network Data Scraper Software to get started instantly 100% Include source code and install file
Basic data extraction from website GEIPAN
Extracts data from a spreadsheet and outputs its contents to a '.SQL' file. Data extraction tool useful for people using SQL Server Express with no access to SSMS addon and import wizard.
Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
Add a description, image, and links to the extract-data topic page so that developers can more easily learn about it.
To associate your repository with the extract-data topic, visit your repo's landing page and select "manage topics."