Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
545 commits
Select commit Hold shift + click to select a range
51a3c88
ARROW-10127: Update specification for Decimal to allow for 256-bits
emkornfield Oct 3, 2020
d0579ac
ARROW-9753: [Rust] [DataFusion] Replaced Arc<Mutex<>> by Box<>
jorgecarleitao Oct 3, 2020
334e33e
ARROW-9520: [Rust] [DataFusion] Add support for aliased aggregate exprs
returnString Oct 3, 2020
b73731b
ARROW-10161: [Rust] [DataFusion] DRYed code in tests
jorgecarleitao Oct 3, 2020
df91582
ARROW-4927: [Rust] Update top level README to describe current functi…
jorgecarleitao Oct 3, 2020
551edca
ARROW-10169: [Rust] Pretty print null PrimitiveTypes as empty strings
alamb Oct 3, 2020
57f548c
ARROW-10155: [Rust] [DataFusion] Improved lib.rs docs
jorgecarleitao Oct 4, 2020
46c75ac
ARROW-10162: [Rust] Add pretty print support for DictionaryArray
alamb Oct 4, 2020
a6b30de
ARROW-10171: [Rust] [DataFusion] Added ExecutionContext::From<Executi…
jorgecarleitao Oct 4, 2020
8c75941
ARROW-9701: [CI][Java] Add a job for s390x Java on TravisCI
kiszk Oct 5, 2020
bb553c4
ARROW-10156: [Rust] Added github action to label PRs for rust.
jorgecarleitao Oct 5, 2020
4555fb9
ARROW-9786: [R] Unvendor cpp11 before release
romainfrancois Oct 5, 2020
7f84722
ARROW-9941: [Python] Better string representation for extension types
dianaclarke Oct 5, 2020
fbb781b
ARROW-8394: [JS] Upgrade to TypeScript 4.0.2, fix typings for TS 3.9+
trxcllnt Oct 5, 2020
68a7ba5
ARROW-8735: [Rust] [Parquet] Allow arm 32 to use soft hash implementa…
hannah-stevenson-frequenz Oct 5, 2020
ecc3ed8
ARROW-10008: [C++][Dataset] Fix filtering/row group statistics of dic…
bkietz Oct 5, 2020
72a0e96
ARROW-10121: [C++] Fix emission of new dictionaries in IPC writer
pitrou Oct 5, 2020
105873e
ARROW-10068: [C++] Add bundled external project for aws-sdk-cpp
nealrichardson Oct 5, 2020
b2842ab
ARROW-10147: [Python] Pandas metadata fails if index name not JSON-se…
dianaclarke Oct 5, 2020
ffaed97
ARROW-8205: [Rust] [DataFusion] Added check to uniqueness of column n…
jorgecarleitao Oct 6, 2020
88b0cd2
ARROW-10041: [Rust] Added check of data type to GenericString::from.
jorgecarleitao Oct 6, 2020
b17cea8
ARROW-10179: [Rust] Fixed error in labeler
jorgecarleitao Oct 6, 2020
c5d998c
ARROW-10189: [Doc] Fixed typo in C-Data interface example
jorgecarleitao Oct 6, 2020
adcb8ef
ARROW-10188: [Rust] [DataFusion] Fixed DataFusion examples.
jorgecarleitao Oct 6, 2020
a880ae6
ARROW-10167: [Rust] [DataFusion] Support DictionaryArray in sql.rs te…
alamb Oct 6, 2020
b1480a6
ARROW-10120: [C++] Add two-level nested Parquet read to Arrow benchmarks
pitrou Oct 6, 2020
c6e0fa4
ARROW-10193: [Python] Segfault when converting to fixed size binary a…
kszucs Oct 6, 2020
814c681
ARROW-10192: [Python] Always decode inner dictionaries when convertin…
pitrou Oct 6, 2020
bca04cd
ARROW-10176: [C++] Avoid using unformattable types for test parameters
bkietz Oct 6, 2020
03dea9b
ARROW-10200: [CI][Java] Fix a job failure for s390x Java on TravisCI
kiszk Oct 6, 2020
8a03003
ARROW-10201: [C++][CI] Disable S3 in arm64 job on Travis CI
kou Oct 7, 2020
e9a12fa
ARROW-10058: [C++] Improve repeated levels conversion without BMI2
pitrou Oct 7, 2020
e9a0ada
ARROW-10202: [CI][Windows] Use sf.net mirror for MSYS2
kou Oct 7, 2020
47a019b
ARROW-10178: [CI] Remove patch to fix Spark master build
BryanCutler Oct 7, 2020
aeef2a6
ARROW-9147: [C++][Dataset] Support projection from null->any type
bkietz Oct 7, 2020
1a2d048
ARROW-10151: [Python] Add support for MapArray conversion to Pandas
BryanCutler Oct 7, 2020
ef08a9d
ARROW-9943: [C++] Recursively apply Arrow metadata when reading from …
pitrou Oct 7, 2020
6a8f84d
ARROW-10205: [Java][FlightRPC] Allow disabling server validation
jduo Oct 7, 2020
8eea173
ARROW-7960: [C++] Add support fo reading additional types
emkornfield Oct 7, 2020
a28b1fb
ARROW-10093: [R] Add ability to opt-out of int64 -> int demotion
romainfrancois Oct 7, 2020
eec7277
ARROW-10214: [Python] Allow printing undecodable schema metadata
pitrou Oct 7, 2020
a3b5bcd
ARROW-10217: [CI] Run fewer GitHub Actions jobs
nealrichardson Oct 7, 2020
62269e9
ARROW-10030: [Rust] Add support for `FromIter` and `IntoIter` for pri…
jorgecarleitao Oct 7, 2020
1c9c718
ARROW-10181: [Rust] Skip compiling one test on 32 bit ARM architecture
andygrove Oct 7, 2020
ad51ac2
ARROW-10196: [C++] Add Future::DeferNotOk
bkietz Oct 7, 2020
25d131b
ARROW-9645: [Python] Deprecate pyarrow.filesystem in favor of pyarrow.fs
jorisvandenbossche Oct 7, 2020
2a49df6
Fix YAML syntax in dev_labeler
nealrichardson Oct 7, 2020
0f56c60
ARROW-9964: [C++] Allow reading date types from CSV data
pitrou Oct 7, 2020
806b0ce
ARROW-6972: [C#] Support for StructArrays
Oct 7, 2020
9526558
ARROW-10204: [Rust] Filter kernel should only count bits in valid range
jhorstmann Oct 7, 2020
732e333
ARROW-9266: [Python][Packaging] Enable S3 support in macOS wheels
kszucs Oct 7, 2020
8366799
ARROW-10099: [C++][Dataset] Simplify type inference for partition col…
bkietz Oct 7, 2020
4bab70f
ARROW-10139: [C++] Add support for building arrow_testing without bui…
kou Oct 7, 2020
ae396b9
ARROW-9782: [C++][Dataset] More configurable Dataset writing
bkietz Oct 8, 2020
1150c38
ARROW-10134: [Python][Dataset] Add ParquetFileFragment.num_row_groups
bkietz Oct 8, 2020
878c534
ARROW-10227: [Ruby] Use a table size as the default for parquet chunk…
kanga333 Oct 8, 2020
d4cbc4b
ARROW-10229: [C++] Remove errant log line
emkornfield Oct 8, 2020
54199ec
ARROW-10023: [C++][Gandiva] Implement split_part function in gandiva
Oct 8, 2020
ba7ee65
ARROW-9967: [Python] Add compute module docs + expose more option cla…
arw2019 Oct 8, 2020
20f2bd4
ARROW-10040: [Rust] Iterate over and combine boolean buffers with arb…
jhorstmann Oct 8, 2020
8447bb1
ARROW-10235: [Rust][DataFusion] Improve documentation for type coercion
alamb Oct 8, 2020
7490391
ARROW-6537 [R]: Pass column_types to CSV reader
romainfrancois Oct 8, 2020
4bbb747
ARROW-10233: [Rust] Make array_value_to_string available in all Arrow…
alamb Oct 8, 2020
945f649
ARROW-9414: [Packaging][deb][RPM] Enable S3
kou Oct 8, 2020
0100121
ARROW-10015: [Rust] Simd aggregate kernels
jhorstmann Oct 8, 2020
1c7581c
ARROW-10043: [Rust][DataFusion] Implement COUNT(DISTINCT col)
drusso Oct 8, 2020
4c101ef
ARROW-10164: [Rust] Add support for DictionaryArray to cast kernel
alamb Oct 8, 2020
beb031f
ARROW-10238: [C#] List<Struct> is broken
eerhardt Oct 9, 2020
109f701
ARROW-10237: [C++] Duplicate dict values cause corrupt parquet
bkietz Oct 9, 2020
f0f7593
ARROW-10239: [C++] Add missing zlib dependency to aws-sdk-cpp
kou Oct 9, 2020
d908bc8
ARROW-9879: [Python] Add support for numpy scalars to ChunkedArray.__…
xhochy Oct 9, 2020
f2ad6a9
ARROW-9956: [C++] [Gandiva] Implementation of binary_string function …
Oct 9, 2020
599b458
ARROW-9518: [Python] Deprecate pyarrow serialization
jorisvandenbossche Oct 9, 2020
b8b4419
ARROW-8355: [Python] Remove hard pandas dependency from FeatherDatas…
arw2019 Oct 9, 2020
97a38a8
ARROW-10231: [CI] Unable to download minio in arm32v7 docker image
kszucs Oct 9, 2020
4d2cf9f
ARROW-10175: [CI] Fix nightly HDFS integration tests (ensure to use l…
jorisvandenbossche Oct 9, 2020
9e587be
ARROW-10206: [C++][Python][FlightRPC] Allow disabling server validation
jduo Oct 9, 2020
20cd08a
ARROW-10215: [Rust] [DataFusion] Renamed Source to SendableRecordBatc…
jorgecarleitao Oct 9, 2020
008578e
ARROW-10114: [R] Segfault in to_dataframe_parallel with deeply nested…
romainfrancois Oct 9, 2020
883eb57
ARROW-9870: [R] Friendly interface for filesystems (S3)
nealrichardson Oct 9, 2020
3694794
ARROW-8296: [C++][Dataset] Add IpcFileWriteOptions
bkietz Oct 9, 2020
a6cbffe
ARROW-10257: [R] Prepare news/docs for 2.0 release
nealrichardson Oct 10, 2020
f405075
ARROW-9962: [Python] Fix conversion to_pandas with tz-aware index col…
jorisvandenbossche Oct 10, 2020
cf1b9f6
ARROW-10244: [Python] Document pyarrow.dataset.parquet_dataset
jorisvandenbossche Oct 10, 2020
9a8bd30
ARROW-10248: [Python][Dataset] Always apply Python's default write pr…
bkietz Oct 10, 2020
4679c7c
ARROW-3080: [Python] Unify Arrow to Python object conversion paths
kszucs Oct 10, 2020
ad45ed5
ARROW-10230: [JS][Doc] JavaScript documentation fails to build
kszucs Oct 10, 2020
d8af4fa
ARROW-10252: [Python] Add option to skip inclusion of Arrow headers i…
xhochy Oct 10, 2020
a86be66
ARROW-10260: [Python] Missing MapType in to_pandas_dtype()
dmarsh19 Oct 10, 2020
cd2179f
ARROW-10256: [C++][Flight] Disable -Werror carefully
kou Oct 10, 2020
5324d98
ARROW-9952: [Python] Optionally use pyarrow.dataset in parquet.write_…
jorisvandenbossche Oct 10, 2020
0a105a3
ARROW-10265: [CI] Use smaller build when cache doesn't exist on Travi…
kou Oct 10, 2020
9d90eb4
ARROW-10266: [CI][macOS] Ensure using Python 3.8 with Homebrew
kou Oct 10, 2020
4a6735a
ARROW-10267: [Python] Skip flight test if disable_server_verification…
kszucs Oct 10, 2020
79c83b2
ARROW-10240: [Rust] Optionally load data into memory before running b…
jhorstmann Oct 10, 2020
0f06c99
ARROW-7957: [Python] Handle new FileSystem in ParquetDataset by autom…
jorisvandenbossche Oct 10, 2020
8178b01
ARROW-10100: [C++][Python][Dataset] Add ParquetFileFragment::Subset m…
jorisvandenbossche Oct 10, 2020
c25fd66
ARROW-9553: [Rust] Release script doesn't bump parquet crate's arrow …
kszucs Oct 10, 2020
ba0f7f1
ARROW-10272: [Packaging][Python] Pin newer multibuild version to avoi…
kszucs Oct 10, 2020
6428aa5
ARROW-10251: [Rust] [DataFusion] MemTable::load() now loads partition…
andygrove Oct 11, 2020
dcc86f4
ARROW-10271: [Rust] Update dependencies
nevi-me Oct 11, 2020
8bbfdc4
ARROW-10234: [C++][Gandiva] Fix logic of round() for floats/decimals …
sgnkc Oct 12, 2020
0b48152
ARROW-10262: [C++] Fix TypeClass for BinaryScalar and LargeBinaryScalar
zanmato1984 Oct 12, 2020
330396a
ARROW-10286: [C++][FlightRPC] Make CMake output less confusing
lidavidm Oct 12, 2020
493d2c6
ARROW-10288: [C++] Fix compilation errors on 32-bit x86
pitrou Oct 12, 2020
e9bb109
ARROW-10287: [C++] Avoid std::random_device
pitrou Oct 12, 2020
e6eb61f
ARROW-10281: [Python] Fix warnings when running tests
jorisvandenbossche Oct 12, 2020
0419c26
ARROW-10284: [Python] Correctly suppress warning about legacy filesys…
jorisvandenbossche Oct 12, 2020
12281c4
ARROW-10285: [Python] Fix usage of deprecated num_children in pyarrow…
jorisvandenbossche Oct 12, 2020
d8f8d46
ARROW-10280: [Packaging][Python] Fix macOS wheel artifact patterns
kszucs Oct 12, 2020
da89ad1
ARROW-10273: [CI][Homebrew] Fix "brew audit" usage
kou Oct 12, 2020
6ce2793
ARROW-10279: [Release][Python] Fix verification script to align with …
kszucs Oct 12, 2020
70ae161
ARROW-10290: [C++] List POP_BACK is not available in older CMake vers…
kszucs Oct 12, 2020
0aa2069
[Release] Update CHANGELOG.md for 2.0.0
kszucs Oct 12, 2020
e46a3c6
[Release] Update .deb/.rpm changelogs for 2.0.0
kszucs Oct 12, 2020
5943421
[Release] Update versions for 2.0.0
kszucs Oct 12, 2020
4782866
[maven-release-plugin] prepare release apache-arrow-2.0.0
kszucs Oct 12, 2020
b1f36ac
[Release] Update versions for 3.0.0-SNAPSHOT
kszucs Oct 19, 2020
f72575c
[Release] Update .deb package names for 3.0.0
kszucs Oct 19, 2020
a3a35b2
ARROW-10263: [C++][Compute] Improve variance kernel numerical stability
cyb70289 Oct 13, 2020
91b5f07
ARROW-10293: [Rust] [DataFusion] Fixed benchmarks
jorgecarleitao Oct 13, 2020
a030fc5
ARROW-10295 [Rust] [DataFusion] Replace Rc<RefCell<>> by Box<> in acc…
jorgecarleitao Oct 13, 2020
34533b6
ARROW-10289: [Rust] Read dictionaries in IPC streams
nevi-me Oct 14, 2020
7209ffc
ARROW-10292: [Rust] [DataFusion] Simplify merge
jorgecarleitao Oct 14, 2020
9e671ac
ARROW-10270: [R] Fix CSV timestamp_parsers test on R-devel
nealrichardson Oct 14, 2020
8f302d3
ARROW-9479: [JS] Fix Table.from for zero-item serialized tables, Tabl…
H-Plus-Time Oct 15, 2020
03c7c02
ARROW-10145: [C++][Dataset] Assert integer overflow in partitioning f…
bkietz Oct 15, 2020
a7ef5d2
ARROW-10174: [Java] Fix reading/writing dict structs
HedgehogCode Oct 16, 2020
3f69ad2
ARROW-10236: [Rust] Add can_cast_types to arrow cast kernel, use in D…
alamb Oct 16, 2020
22027c7
ARROW-10294: [Java] Resolve problems of DecimalVector APIs on ArrowBufs
liyafan82 Oct 16, 2020
cb58146
ARROW-9475: [Java] Clean up usages of BaseAllocator, use BufferAlloca…
zhztheplayer Oct 16, 2020
3f96cc0
ARROW-10313: [C++] Faster UTF8 validation for small strings
pitrou Oct 16, 2020
36bf7a4
ARROW-9898: [C++][Gandiva] Fix linking issue with castINT/FLOAT funct…
projjal Oct 16, 2020
2fe8b86
ARROW-10311: [Release] Update crossbow verification process
kszucs Oct 16, 2020
3dbea74
ARROW-10321: [C++] Use check_cxx_source_compiles for AVX512 detect in…
frankdjx Oct 16, 2020
be21209
ARROW-5350: [Rust] Allow filtering on simple lists
nevi-me Oct 18, 2020
29e9d13
ARROW-10002: [Rust] Remove trait specialization from arrow crate
jorgecarleitao Oct 18, 2020
ad2a002
ARROW-10163: [Rust] [DataFusion] Add DictionaryArray coercion support
alamb Oct 18, 2020
b653922
ARROW-10323: [Release][wheel] Add missing verification setup step
kou Oct 19, 2020
5af12b0
ARROW-10340: [Packaging][deb][RPM] Use Python 3.8 for pygit2
kou Oct 19, 2020
6ad24c2
ARROW-10310: [C++][Gandiva] Add single argument round() in Gandiva
sgnkc Oct 19, 2020
139fb51
ARROW-10241: [C++][Compute] Add variance kernel benchmark
cyb70289 Oct 19, 2020
457935e
ARROW-10106: [FlightRPC][Java] Expose onIsReady() callback
jduo Oct 19, 2020
7944265
ARROW-10203: [Doc] Give guidance on big-endian support in the contrib…
emkornfield Oct 19, 2020
0e13e28
ARROW-9164: [C++] Add embedded documentation to compute functions
pitrou Oct 19, 2020
445251a
ARROW-9991: [C++] Split kernels for strings/binary
maartenbreddels Oct 19, 2020
04660f8
ARROW-10348: [C++] Fix crash on invalid Parquet data
pitrou Oct 19, 2020
7696b89
ARROW-10338: [Rust] Use const fn for applicable methods
vertexclique Oct 20, 2020
2321542
ARROW-10318: [C++] Use pimpl idiom in CSV parser
pitrou Oct 20, 2020
84e4b15
ARROW-10320 [Rust] [DataFusion] Migrated from batch iterators to batc…
jorgecarleitao Oct 20, 2020
760284f
ARROW-10358: [R] Followups to 2.0.0 release
nealrichardson Oct 20, 2020
f06c68a
ARROW-10331: [Rust] [DataFusion] Re-organize DataFusion errors
jorgecarleitao Oct 21, 2020
bb4f2a0
ARROW-9963: [Python] Recognize datetime.timezone.utc as UTC on conver…
jorisvandenbossche Oct 21, 2020
16412a1
ARROW-10328: [C++] Vendor fast_float number parsing library
pitrou Oct 21, 2020
625bf3f
ARROW-10363: [Python] Remove CMake bug workaround in manylinux
pitrou Oct 22, 2020
e2d8dc3
ARROW-10304: [C++][Compute] Optimize variance kernel for integers
cyb70289 Oct 22, 2020
5ad3c98
ARROW-10369: [Dev] Fix archery release utility test cases
kszucs Oct 22, 2020
843e8bb
ARROW-9304: [C++] Add "AppendEmpty" builder APIs for use inside Struc…
tianchen92 Oct 22, 2020
103489b
ARROW-10208: [C++] Fix split string kernels on sliced input
pitrou Oct 22, 2020
39a7539
ARROW-10207: [C++] Allow precomputing output string/list offsets in k…
pitrou Oct 22, 2020
982ca83
ARROW-10365: [R] Remove duplicate setting of S3 flag on macOS
nealrichardson Oct 22, 2020
9cb0e9f
ARROW-10249: [Rust] Support nested dictionaries inside list arrays
vertexclique Oct 23, 2020
b3992ea
ARROW-10333: [Java] Get rid of org.apache.arrow.util in vector
HedgehogCode Oct 23, 2020
590541d
ARROW-9747: [Java][C++] Initial Support for 256-bit Decimals
MingyuZhong Oct 23, 2020
c49b5da
ARROW-10302: [Python] Don't double-package plasma-store-server
xhochy Oct 23, 2020
eea8d35
ARROW-10375 [Rust] Removed PrimitiveArrayOps
jorgecarleitao Oct 23, 2020
da50838
ARROW-10383: [Doc] fix typos
kiszk Oct 24, 2020
aa2e6d4
ARROW-10384: [C++] Fix typos
kiszk Oct 24, 2020
221de70
ARROW-10382: [Rust] Fix typos
kiszk Oct 24, 2020
a764d3b
ARROW-10332: [Rust] Allow CSV reader to iterate from start up to end
jorgecarleitao Oct 25, 2020
7155cd5
ARROW-10135: [Rust] [Parquet] Refactor file module to help adding sou…
rdettai Oct 25, 2020
f6501a5
ARROW-10319: [Go][Flight] Add context to flight client auth handler
Oct 26, 2020
06d4f17
ARROW-10390: [Rust][Parquet] Ensure it is possible to create custom p…
alamb Oct 26, 2020
9d424b5
ARROW-10394 [Rust] Validate datatype on [Large]BinaryArray creation
jorgecarleitao Oct 27, 2020
e22820f
ARROW-10336: [Rust] Added FromIter and ToIter for string arrays
jorgecarleitao Oct 27, 2020
cf61fa1
ARROW-10385: [C++][Gandiva] Add support for LLVM 11
kou Oct 27, 2020
9b70673
ARROW-10400: [C++] Propagate TLS client peer_identity when using mutu…
raduteo Oct 27, 2020
c3091dd
ARROW-10399: [R] Fix performance regression from cpp11::r_string
nealrichardson Oct 27, 2020
3ab3b4b
ARROW-10397: [C++] Update comment to match change made in b1a7a73ff2
carols10cents Oct 28, 2020
765f20c
ARROW-10408: [Java] Bump Avro to 1.10.0
Fokko Oct 28, 2020
8513e7f
ARROW-10393: [Rust] Apply fix for null reading in json reader for nested
vertexclique Oct 28, 2020
42cb805
ARROW-10392: [C++][Gandiva] Avoid string copy while evaluating IN exp…
pprudhvi Oct 28, 2020
710b80b
ARROW-8289: [Rust] Parquet Arrow writer with nested support
nevi-me Aug 13, 2020
57be963
ARROW-8423: [Rust] [Parquet] Serialize Arrow schema metadata
nevi-me Aug 18, 2020
cf3861a
ARROW-10095: [Rust] Update rust-parquet-arrow-writer branch's encode_…
carols10cents Sep 25, 2020
9ccb521
ARROW-8426: [Rust] [Parquet] Add support for writing dictionary types
carols10cents Oct 3, 2020
c8a59aa
ARROW-10191: [Rust] [Parquet] Add roundtrip Arrow -> Parquet tests fo…
carols10cents Oct 6, 2020
966a703
ARROW-10168: [Rust] [Parquet] Schema roundtrip - use Arrow schema fro…
carols10cents Oct 7, 2020
90f365d
ARROW-10225: [Rust] [Parquet] Fix null comparison in roundtrip
nevi-me Oct 8, 2020
54ee9fc
ARROW-10334: [Rust] [Parquet] NullArray roundtrip
nevi-me Oct 17, 2020
0d620c1
ARROW-7842: [Rust] [Parquet] Arrow list reader
nevi-me Oct 17, 2020
3cc3a4b
ARROW-8426: [Rust] [Parquet] - Add more support for converting Dicts
carols10cents Oct 27, 2020
38971af
ARROW-10388: [Java] Fix Spark integration build failure
liyafan82 Oct 29, 2020
16f11d5
ARROW-10381: [Rust] Generalized Ordering for inter-array comparisons
jorgecarleitao Oct 29, 2020
9867c48
ARROW-10413: [Rust] [Parquet] Unignore some tests that are passing now
carols10cents Oct 29, 2020
7e503af
ARROW-10389: [Rust] [DataFusion] Make the custom source implementatio…
rdettai Oct 29, 2020
868777d
ARROW-10131: [C++][Dataset][Python] Lazily parse parquet metadata
bkietz Oct 29, 2020
b109195
ARROW-10080: [R] Call gc() and try again in MemoryPool
bkietz Oct 30, 2020
687dcea
ARROW-9704: [Java] TestEndianness.testLittleEndian supports little- a…
kiszk Oct 30, 2020
792ea98
ARROW-10410: [Rust] Some refactorings
Oct 31, 2020
cc54326
ARROW-10396: [Rust] [Parquet] Publically export SliceableCursor and F…
alamb Oct 31, 2020
e7ce8cf
ARROW-10398 [Rust] [Parquet] Re-Export parquet::record::api::Field
seen Oct 31, 2020
9231976
[C++][Build] Fix build error on s390x (#8563)
kiszk Nov 2, 2020
a8eb736
ARROW-9804: [FlightRPC] Flight auth redesign
jduo Nov 2, 2020
9552486
ARROW-9709: [Java] Test cases in arrow-vector takes care of endianness
kiszk Nov 2, 2020
86997e7
ARROW-10424: [Rust] Minor simplification to the generic impl Primitiv…
jorgecarleitao Nov 2, 2020
08993ea
ARROW-10434: [Rust] Fix debug formatting for arrays with lengths betw…
drusso Nov 2, 2020
0573675
ARROW-10454: [Rust] [Datafusion] support creating ParquetExec from fi…
Nov 2, 2020
9f13810
ARROW-10458: [Rust] [Datafusion] create_logical_plan should not requi…
houqp Nov 2, 2020
145ff87
ARROW-10446: [C++][Python] Roundtrip Timestamp ns with TzInfo correctly
emkornfield Nov 3, 2020
959e8c5
ARROW-10300: [Rust] Improve documentation for TPC-H benchmark
andygrove Nov 3, 2020
c02ed53
ARROW-10422: [Rust] Removed unused trait BinaryArrayBuilder
jorgecarleitao Nov 3, 2020
3a87324
ARROW-10478: [Dev][Release] Correct Java versions to 3.0.0-SNAPSHOT
jduo Nov 3, 2020
b4ddc96
ARROW-10490: [C++][GLib] Fix range-loop-analysis warnings
kou Nov 4, 2020
913cd76
ARROW-9897: [C++][Gandiva] Revert - to_date function
sgnkc Nov 4, 2020
5355f7c
ARROW-9861: [Java] Support big-endian in DecimalVector
kiszk Nov 4, 2020
1c223f5
ARROW-10468: [C++][Compute] Provide KernelExecutor instead of Functio…
bkietz Nov 4, 2020
818cafc
ARROW-10496: [R][CI] Fix conda-r job
nealrichardson Nov 4, 2020
bcbcead
ARROW-10371: [R] Linux system requirements check needs to support old…
nealrichardson Nov 4, 2020
3eb6e69
ARROW-10441: [Java] Prevent closure of shared channels for FlightClient
kylep-dremio Nov 4, 2020
c819964
ARROW-10491: [FlightRPC][Java] Fix NPE when using makeContext
jduo Nov 4, 2020
d4121d8
ARROW-10495: [Packaging][deb] Move FindRE2.cmake to libarrow-dev
kou Nov 5, 2020
7e740b8
ARROW-4960: [R] Build r-arrow conda package in crossbow
xhochy Nov 5, 2020
15254f9
ARROW-10504: [C++] Suppress UBSAN pointer-overflow warning in RapidJSON
bkietz Nov 5, 2020
cb4f26d
ARROW-10499: [C++][Java] Fix ORC Java JNI Crash
terencehonles Nov 5, 2020
e79a481
ARROW-10489: [C++] Add support for intel compiler
jensenrichardson Nov 5, 2020
78da3e6
ARROW-10502: [C++/Python] CUDA detection messes up nightly conda-win …
xhochy Nov 5, 2020
b26478a
ARROW-10467: [FlightRPC][Java] Add the ability to pass arbitrary clie…
kylep-dremio Nov 6, 2020
ddda960
ARROW-10482: [Python] Fix compression per column in Parquet writing
jorisvandenbossche Nov 6, 2020
47f2e0c
ARROW-10428: [FlightRPC][Java] Add support for HTTP cookies
jduo Nov 6, 2020
e6366dc
ARROW-10378: [Rust] Update take() kernel with support for LargeList.
drusso Nov 7, 2020
39c187d
ARROW-10477: [Rust] Add iterator support for Binary arrays.
jorgecarleitao Nov 7, 2020
61f5a66
ARROW-10476: [Rust] Allow string arrays to be built from Option<&str>…
jorgecarleitao Nov 7, 2020
b1dfd96
ARROW-10261: [Rust] [Breaking] Change List datatype to Box<Field>
nevi-me Nov 7, 2020
2e284f4
ARROW-10449 [Rust] Make Dictionary::keys be an array
jorgecarleitao Nov 7, 2020
4dfbc8b
ARROW-10448: [Rust] Remove PrimitiveArray::new that can cause UB
jorgecarleitao Nov 7, 2020
130f6a0
ARROW-10445: [Rust] Added doubleEnded iterator to PrimitiveArrayIter
jorgecarleitao Nov 7, 2020
eb42c50
ARROW-10510: [Rust] [DataFusion] Benchmark COUNT(DISTINCT) queries.
drusso Nov 7, 2020
a04a15a
ARROW-10402: [Rust] Refactor array equality
jorgecarleitao Nov 7, 2020
fb91d53
ARROW-10228: [Julia] Contribute Julia implementation
quinnj Nov 8, 2020
a3f97ab
ARROW-10462: [Python] Fix usage of fsspec in ParquetDataset causing p…
jorisvandenbossche Nov 9, 2020
66aad85
ARROW-10471: [CI][Python] Ensure we have tests with s3fs and run thos…
jorisvandenbossche Nov 9, 2020
19d677a
ARROW-10503: [C++] Uriparser will not compile using Intel compiler
jensenrichardson Nov 9, 2020
3767689
ARROW-10509: [C++] Define operator<<(ostream, ParquetException) for c…
xhochy Nov 9, 2020
a9988ee
ARROW-10346: [Python] Ensure tests aren't affected by user-supplied A…
xhochy Nov 9, 2020
eccdd48
ARROW-10325: [C++][Compute] Refine aggregate kernel registration
cyb70289 Nov 9, 2020
995abdc
ARROW-7531: [C++] Reduce header inclusion cost slightly
pitrou Nov 9, 2020
3e72c70
ARROW-5394: [C++][Benchmark] IsIn and IndexIn benchmark for integer a…
david1437 Nov 9, 2020
e7d56ee
ARROW-9361 [Rust] Move array types into their own modules
jorgecarleitao Nov 10, 2020
34ff231
Add a note about being patient after gitbox enable
alamb Nov 10, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 2 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
Expand Up @@ -55,6 +55,8 @@
!rust/arrow-flight/Cargo.toml
!rust/parquet/Cargo.toml
!rust/parquet/build.rs
!rust/parquet_derive/Cargo.toml
!rust/parquet_derive_test/Cargo.toml
!rust/datafusion/Cargo.toml
!rust/datafusion/benches
!rust/integration-testing/Cargo.toml
6 changes: 3 additions & 3 deletions .env
Original file line number Diff line number Diff line change
Expand Up @@ -24,13 +24,13 @@ ARCH=amd64
CUDA=9.1
DEBIAN=10
UBUNTU=18.04
FEDORA=32
FEDORA=33
PYTHON=3.6
LLVM=10
LLVM=11
CLANG_TOOLS=8
RUST=nightly-2020-04-22
GO=1.12
NODE=11
NODE=14
MAVEN=3.5.4
JDK=8
PANDAS=latest
Expand Down
5 changes: 3 additions & 2 deletions .github/workflows/archery.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,9 +42,10 @@ jobs:
uses: actions/checkout@v2
with:
fetch-depth: 0
- name: Fetch Submodules and Tags
- name: Git Fixup
if: ${{ github.event_name == 'pull_request' }}
shell: bash
run: ci/scripts/util_checkout.sh
run: git branch master origin/master
- name: Free Up Disk Space
run: ci/scripts/util_cleanup.sh
- name: Setup Python
Expand Down
9 changes: 5 additions & 4 deletions .github/workflows/comment_bot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ jobs:
git remote add upstream https://github.com/apache/arrow
git fetch upstream
changed() {
git diff --name-only HEAD^..upstream/master | grep -e "$1" >/dev/null 2>&1
git diff --name-only HEAD..upstream/master | grep -e "$1" >/dev/null 2>&1
}
if changed '^r/.*\.R$'; then
echo "::set-env name=R_DOCS::true"
Expand All @@ -84,10 +84,11 @@ jobs:
echo "::set-env name=CLANG_FORMAT_R::true"
fi
- name: Run cmake_format
if: false
# TODO: make this work https://issues.apache.org/jira/browse/ARROW-8489
# if: env.CMAKE_FORMAT == 'true' || endsWith(github.event.comment.body, 'everything')
if: env.CMAKE_FORMAT == 'true' || endsWith(github.event.comment.body, 'everything')
run: |
set -ex
export PATH=/home/runner/.local/bin:$PATH
python3 -m pip install --upgrade pip setuptools wheel
python3 -m pip install -r dev/archery/requirements-lint.txt
python3 run-cmake-format.py
- name: Run clang-format on cpp
Expand Down
40 changes: 26 additions & 14 deletions .github/workflows/cpp.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,9 +102,11 @@ jobs:
# hosted machines
name: ${{ matrix.title }}
runs-on: ${{ matrix.runner }}
if: github.event_name == 'push'
# TODO(kszucs): re-enable once the self-hosted workers are properly
# registered to github
if: false && github.event_name == 'push'
defaults:
# to use certain environment variables are set by .bashrc an interactive
# To use certain environment variables set by .bashrc, an interactive
# bash shell must be used
run:
shell: bash -i {0}
Expand All @@ -119,11 +121,12 @@ jobs:
debian: 10
title: ARM32v7 Debian 10 C++
image: |
-e CPP_MAKE_PARALLELISM=4 \
-e CPP_MAKE_PARALLELISM=2 \
-e CXXFLAGS=-Wno-psabi \
-e ARROW_PARQUET=OFF \
-e ARROW_FLIGHT=OFF \
-e ARROW_GANDIVA=OFF \
-e ARROW_ORC=OFF \
-e CMAKE_ARGS=-DARROW_CPU_FLAG=armv7 \
debian-cpp
arch: 'arm32v7'
Expand Down Expand Up @@ -186,23 +189,24 @@ jobs:
strategy:
fail-fast: false
env:
ARROW_BUILD_TESTS: ON
ARROW_DATASET: ON
ARROW_FLIGHT: ON
ARROW_GANDIVA: ON
ARROW_HDFS: ON
ARROW_HOME: /usr/local
ARROW_JEMALLOC: ON
# TODO(kszucs): link error in the tests
ARROW_DATASET: ON
ARROW_ORC: OFF
ARROW_FLIGHT: ON
ARROW_HDFS: ON
ARROW_PLASMA: ON
ARROW_GANDIVA: ON
ARROW_PARQUET: ON
ARROW_WITH_ZLIB: ON
ARROW_WITH_LZ4: ON
ARROW_PLASMA: ON
ARROW_S3: ON
ARROW_WITH_BROTLI: ON
ARROW_WITH_BZ2: ON
ARROW_WITH_ZSTD: ON
ARROW_WITH_LZ4: ON
ARROW_WITH_SNAPPY: ON
ARROW_WITH_BROTLI: ON
ARROW_BUILD_TESTS: ON
ARROW_WITH_ZLIB: ON
ARROW_WITH_ZSTD: ON
steps:
- name: Checkout Arrow
uses: actions/checkout@v2
Expand Down Expand Up @@ -319,6 +323,7 @@ jobs:
ARROW_JEMALLOC: OFF
ARROW_PARQUET: ON
ARROW_PYTHON: ON
ARROW_S3: ON
ARROW_USE_GLOG: OFF
ARROW_VERBOSE_THIRDPARTY_BUILD: OFF
ARROW_WITH_BROTLI: ON
Expand All @@ -333,7 +338,6 @@ jobs:
CMAKE_ARGS: >-
-DARROW_PACKAGE_PREFIX=/mingw${{ matrix.mingw-n-bits }}
-DBoost_NO_BOOST_CMAKE=ON
CMAKE_GENERATOR: MSYS Makefiles
CMAKE_UNITY_BUILD: ON
steps:
- name: Disable Crash Dialogs
Expand Down Expand Up @@ -370,6 +374,14 @@ jobs:
run: |
export CMAKE_BUILD_PARALLEL_LEVEL=$NUMBER_OF_PROCESSORS
ci/scripts/cpp_build.sh "$(pwd)" "$(pwd)/build"
- name: Download MinIO
shell: msys2 {0}
run: |
mkdir -p /usr/local/bin
wget \
--output-document /usr/local/bin/minio.exe \
https://dl.min.io/server/minio/release/windows-amd64/minio.exe
chmod +x /usr/local/bin/minio.exe
- name: Test
shell: msys2 {0}
run: |
Expand Down
10 changes: 5 additions & 5 deletions .github/workflows/cpp_cron.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ jobs:
matrix:
name:
- amd64-debian-10-cpp
- amd64-fedora-32-cpp
- amd64-fedora-33-cpp
- amd64-ubuntu-16.04-cpp
- amd64-ubuntu-18.04-cpp
- amd64-ubuntu-18.04-cpp-cmake32
Expand All @@ -55,10 +55,10 @@ jobs:
image: debian-cpp
title: AMD64 Debian 10 C++
debian: 10
- name: amd64-fedora-32-cpp
- name: amd64-fedora-33-cpp
image: fedora-cpp
title: AMD64 Fedora 32 C++
fedora: 32
title: AMD64 Fedora 33 C++
fedora: 33
- name: amd64-ubuntu-16.04-cpp
image: ubuntu-cpp
title: AMD64 Ubuntu 16.04 C++
Expand All @@ -75,7 +75,7 @@ jobs:
# the defaults here should correspond to the values in .env
ARCH: 'amd64'
DEBIAN: ${{ matrix.debian || 10 }}
FEDORA: ${{ matrix.fedora || 32 }}
FEDORA: ${{ matrix.fedora || 33 }}
UBUNTU: ${{ matrix.ubuntu || 18.04 }}
steps:
- name: Checkout Arrow
Expand Down
37 changes: 0 additions & 37 deletions .github/workflows/dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -59,43 +59,6 @@ jobs:
continue-on-error: true
run: archery docker push ubuntu-lint

docs:
name: Sphinx and API documentations
runs-on: ubuntu-latest
if: github.event_name == 'push'
steps:
- name: Checkout Arrow
uses: actions/checkout@v2
with:
fetch-depth: 0
- name: Fetch Submodules and Tags
shell: bash
run: ci/scripts/util_checkout.sh
- name: Free Up Disk Space
shell: bash
run: ci/scripts/util_cleanup.sh
- name: Cache Docker Volumes
uses: actions/cache@v1
with:
path: .docker
key: ubuntu-18.04-${{ hashFiles('cpp/**') }}
restore-keys: ubuntu-18.04-
- name: Setup Python
uses: actions/setup-python@v1
with:
python-version: 3.8
- name: Setup Archery
run: pip install -e dev/archery[docker]
- name: Execute Docker Build
run: |
sudo sysctl -w kernel.core_pattern="core.%e.%p"
ulimit -c unlimited
archery docker run ubuntu-docs
- name: Docker Push
if: success() && github.event_name == 'push' && github.repository == 'apache/arrow'
continue-on-error: true
run: archery docker push ubuntu-docs

release:
name: Source Release and Merge Script
runs-on: ubuntu-latest
Expand Down
34 changes: 34 additions & 0 deletions .github/workflows/dev_labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

name: PR labeler
on:
pull_request_target:
types: [opened, reopened]
paths:
- 'rust/**'

jobs:
assign-rust-labels:
runs-on: ubuntu-latest
steps:
- name: Assign Github labels
uses: actions/[email protected]
with:
repo-token: ${{ secrets.GITHUB_TOKEN }}
configuration-path: .github/workflows/dev_labeler/labeler.yml
sync-labels: true
22 changes: 22 additions & 0 deletions .github/workflows/dev_labeler/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

lang-rust:
- rust/**/*

datafusion:
- rust/datafusion/**/*
2 changes: 1 addition & 1 deletion .github/workflows/java.yml
Original file line number Diff line number Diff line change
Expand Up @@ -83,7 +83,7 @@ jobs:
- name: Docker Push
if: success() && github.event_name == 'push' && github.repository == 'apache/arrow'
continue-on-error: true
run: archery docker push debian-go
run: archery docker push debian-java

macos:
name: AMD64 MacOS 10.15 Java JDK ${{ matrix.jdk }}
Expand Down
6 changes: 3 additions & 3 deletions .github/workflows/js.yml
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ env:
jobs:

docker:
name: AMD64 Debian 10 NodeJS 11
name: AMD64 Debian 10 NodeJS 14
runs-on: ubuntu-latest
if: ${{ !contains(github.event.pull_request.title, 'WIP') }}
steps:
Expand Down Expand Up @@ -75,7 +75,7 @@ jobs:
strategy:
fail-fast: false
matrix:
node: [11]
node: [14]
steps:
- name: Checkout Arrow
uses: actions/checkout@v2
Expand Down Expand Up @@ -103,7 +103,7 @@ jobs:
# strategy:
# fail-fast: false
# matrix:
# node: [11]
# node: [14]
# steps:
# - name: Checkout Arrow
# uses: actions/checkout@v1
Expand Down
17 changes: 9 additions & 8 deletions .github/workflows/python.yml
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ jobs:
- ubuntu-16.04-python-3
- conda-python-3.8-nopandas
- conda-python-3.6-pandas-0.23
- conda-python-3.6-pandas-latest
- conda-python-3.7-pandas-latest
- centos-python-3.6-manylinux1
include:
- name: ubuntu-16.04-python-3
Expand All @@ -70,11 +70,11 @@ jobs:
title: AMD64 Conda Python 3.6 Pandas 0.23
python: 3.6
pandas: 0.23
- name: conda-python-3.6-pandas-latest
cache: conda-python-3.6
- name: conda-python-3.7-pandas-latest
cache: conda-python-3.7
image: conda-python-pandas
title: AMD64 Conda Python 3.6 Pandas latest
python: 3.6
title: AMD64 Conda Python 3.7 Pandas latest
python: 3.7
pandas: latest
- name: centos-python-3.6-manylinux1
cache: manylinux1
Expand Down Expand Up @@ -116,7 +116,7 @@ jobs:
run: archery docker push ${{ matrix.image }}

macos:
name: AMD64 MacOS 10.15 Python 3.7
name: AMD64 MacOS 10.15 Python 3
runs-on: macos-latest
if: ${{ !contains(github.event.pull_request.title, 'WIP') }}
env:
Expand Down Expand Up @@ -150,8 +150,9 @@ jobs:
brew update --preinstall
brew bundle --file=cpp/Brewfile
brew install coreutils python
pip3 install -r python/requirements-build.txt \
-r python/requirements-test.txt
python3 -mpip install \
-r python/requirements-build.txt \
-r python/requirements-test.txt
- name: Build
shell: bash
run: |
Expand Down
Loading