- patch for
local_db()
by defaulting path to subdir. - update roxygen
- Added window-parallel option for ark'ing large tables in parallel
- More conditional testing on M1/arm Mac
- Patch for test suite for Solaris.
arrow
package installs on Solaris, but functions do not actually run correctly since the C++ libraries have not been set up properly on Solaris.
- Added ability to name output files directly.
- Add warning when users specify compression for parquet files.
- Added callback functionality to the
ark
function. Allowing users to perform transformations or recodes before chunked data.frames are saved to disk. - Added ability to filter databases by allowing users to specify a "WHERE" clause.
- Added parquet as an streamable_table format, allowing users to
ark
to parquet instead of a text format.
- Bugfix for arkdb
- Make cached connection opt-out instead of applying only to read_only. This allows cache to work on read-write connections by default. This also avoids the condition of a connection being garbage-collected when functions call local_db internally.
- Better handling of read_only vs read_write connections. Only caches read_only connections.
- Includes optional support for MonetDBLite
- Bugfix for dplyr 2.0.0 release
- Bugfix for upcoming dplyr 2.0.0 release
- Support vroom as an opt-in streamable table
- Export
process_chunks
- Add mechanism to attempt a bulk importer, when available (#27)
- Bugfix for case when text contains
#
characters in base parser (#28) - Lighten core dependencies. Fully recursive dependencies include only 4
non-base packages now, as
progress
is now optional. - Use "magic numbers" instead of extensions to guess compression type. (NOTE: requires that file is local and not a URL)
- Now that
duckdb
is on CRAN andMonetDBLite
isn't, drop built-in support forMonetDBLite
in favor ofduckdb
alone.
ark()
's defaultkeep-open
method would cut off header names for Postgres connections (due to variation in the behavior of SQL queries withLIMIT 0
.) The issue is now resolved by accessing the header in a more robust, general way.
unark()
will strip out non-compliant characters in table names by default.unark()
gains the optional argumenttablenames
, allowing the user to specify the corresponding table names manually, rather than enforcing they correspond with the incoming file names. #18unark()
gains the argumentencoding
, allowing users to directly set the encoding of incoming files. Previously this could only be set by settingoptions(encoding)
, which will still work as well. SeeFAO.R
example inexamples
for an illustration.unark()
will now attempt to guess which streaming parser to use (e.gcsv
ortsv
) based on the file extension pattern, rather than defaulting to atsv
parser. (ark()
still defaults to exporting in the more portabletsv
format).
- Remove dependency on utils::askYesNo for backward compatibility, #17
- Ensure the suggested dependency MonetDBLite is available before running unit test using it.
- Overwrite existing tables of same name (with warning and interactive proceed) in both DB and text-files to avoid appending.