Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
144 commits
Select commit Hold shift + click to select a range
814c681
ARROW-10192: [Python] Always decode inner dictionaries when convertin…
pitrou Oct 6, 2020
bca04cd
ARROW-10176: [C++] Avoid using unformattable types for test parameters
bkietz Oct 6, 2020
03dea9b
ARROW-10200: [CI][Java] Fix a job failure for s390x Java on TravisCI
kiszk Oct 6, 2020
8a03003
ARROW-10201: [C++][CI] Disable S3 in arm64 job on Travis CI
kou Oct 7, 2020
e9a12fa
ARROW-10058: [C++] Improve repeated levels conversion without BMI2
pitrou Oct 7, 2020
e9a0ada
ARROW-10202: [CI][Windows] Use sf.net mirror for MSYS2
kou Oct 7, 2020
47a019b
ARROW-10178: [CI] Remove patch to fix Spark master build
BryanCutler Oct 7, 2020
aeef2a6
ARROW-9147: [C++][Dataset] Support projection from null->any type
bkietz Oct 7, 2020
1a2d048
ARROW-10151: [Python] Add support for MapArray conversion to Pandas
BryanCutler Oct 7, 2020
ef08a9d
ARROW-9943: [C++] Recursively apply Arrow metadata when reading from …
pitrou Oct 7, 2020
6a8f84d
ARROW-10205: [Java][FlightRPC] Allow disabling server validation
jduo Oct 7, 2020
8eea173
ARROW-7960: [C++] Add support fo reading additional types
emkornfield Oct 7, 2020
a28b1fb
ARROW-10093: [R] Add ability to opt-out of int64 -> int demotion
romainfrancois Oct 7, 2020
eec7277
ARROW-10214: [Python] Allow printing undecodable schema metadata
pitrou Oct 7, 2020
a3b5bcd
ARROW-10217: [CI] Run fewer GitHub Actions jobs
nealrichardson Oct 7, 2020
62269e9
ARROW-10030: [Rust] Add support for `FromIter` and `IntoIter` for pri…
jorgecarleitao Oct 7, 2020
1c9c718
ARROW-10181: [Rust] Skip compiling one test on 32 bit ARM architecture
andygrove Oct 7, 2020
ad51ac2
ARROW-10196: [C++] Add Future::DeferNotOk
bkietz Oct 7, 2020
25d131b
ARROW-9645: [Python] Deprecate pyarrow.filesystem in favor of pyarrow.fs
jorisvandenbossche Oct 7, 2020
2a49df6
Fix YAML syntax in dev_labeler
nealrichardson Oct 7, 2020
0f56c60
ARROW-9964: [C++] Allow reading date types from CSV data
pitrou Oct 7, 2020
806b0ce
ARROW-6972: [C#] Support for StructArrays
Oct 7, 2020
9526558
ARROW-10204: [Rust] Filter kernel should only count bits in valid range
jhorstmann Oct 7, 2020
732e333
ARROW-9266: [Python][Packaging] Enable S3 support in macOS wheels
kszucs Oct 7, 2020
8366799
ARROW-10099: [C++][Dataset] Simplify type inference for partition col…
bkietz Oct 7, 2020
4bab70f
ARROW-10139: [C++] Add support for building arrow_testing without bui…
kou Oct 7, 2020
ae396b9
ARROW-9782: [C++][Dataset] More configurable Dataset writing
bkietz Oct 8, 2020
1150c38
ARROW-10134: [Python][Dataset] Add ParquetFileFragment.num_row_groups
bkietz Oct 8, 2020
878c534
ARROW-10227: [Ruby] Use a table size as the default for parquet chunk…
kanga333 Oct 8, 2020
d4cbc4b
ARROW-10229: [C++] Remove errant log line
emkornfield Oct 8, 2020
54199ec
ARROW-10023: [C++][Gandiva] Implement split_part function in gandiva
Oct 8, 2020
ba7ee65
ARROW-9967: [Python] Add compute module docs + expose more option cla…
arw2019 Oct 8, 2020
20f2bd4
ARROW-10040: [Rust] Iterate over and combine boolean buffers with arb…
jhorstmann Oct 8, 2020
8447bb1
ARROW-10235: [Rust][DataFusion] Improve documentation for type coercion
alamb Oct 8, 2020
7490391
ARROW-6537 [R]: Pass column_types to CSV reader
romainfrancois Oct 8, 2020
4bbb747
ARROW-10233: [Rust] Make array_value_to_string available in all Arrow…
alamb Oct 8, 2020
945f649
ARROW-9414: [Packaging][deb][RPM] Enable S3
kou Oct 8, 2020
0100121
ARROW-10015: [Rust] Simd aggregate kernels
jhorstmann Oct 8, 2020
1c7581c
ARROW-10043: [Rust][DataFusion] Implement COUNT(DISTINCT col)
drusso Oct 8, 2020
4c101ef
ARROW-10164: [Rust] Add support for DictionaryArray to cast kernel
alamb Oct 8, 2020
beb031f
ARROW-10238: [C#] List<Struct> is broken
eerhardt Oct 9, 2020
109f701
ARROW-10237: [C++] Duplicate dict values cause corrupt parquet
bkietz Oct 9, 2020
f0f7593
ARROW-10239: [C++] Add missing zlib dependency to aws-sdk-cpp
kou Oct 9, 2020
d908bc8
ARROW-9879: [Python] Add support for numpy scalars to ChunkedArray.__…
xhochy Oct 9, 2020
f2ad6a9
ARROW-9956: [C++] [Gandiva] Implementation of binary_string function …
Oct 9, 2020
599b458
ARROW-9518: [Python] Deprecate pyarrow serialization
jorisvandenbossche Oct 9, 2020
b8b4419
ARROW-8355: [Python] Remove hard pandas dependency from FeatherDatas…
arw2019 Oct 9, 2020
97a38a8
ARROW-10231: [CI] Unable to download minio in arm32v7 docker image
kszucs Oct 9, 2020
4d2cf9f
ARROW-10175: [CI] Fix nightly HDFS integration tests (ensure to use l…
jorisvandenbossche Oct 9, 2020
9e587be
ARROW-10206: [C++][Python][FlightRPC] Allow disabling server validation
jduo Oct 9, 2020
20cd08a
ARROW-10215: [Rust] [DataFusion] Renamed Source to SendableRecordBatc…
jorgecarleitao Oct 9, 2020
008578e
ARROW-10114: [R] Segfault in to_dataframe_parallel with deeply nested…
romainfrancois Oct 9, 2020
883eb57
ARROW-9870: [R] Friendly interface for filesystems (S3)
nealrichardson Oct 9, 2020
3694794
ARROW-8296: [C++][Dataset] Add IpcFileWriteOptions
bkietz Oct 9, 2020
a6cbffe
ARROW-10257: [R] Prepare news/docs for 2.0 release
nealrichardson Oct 10, 2020
f405075
ARROW-9962: [Python] Fix conversion to_pandas with tz-aware index col…
jorisvandenbossche Oct 10, 2020
cf1b9f6
ARROW-10244: [Python] Document pyarrow.dataset.parquet_dataset
jorisvandenbossche Oct 10, 2020
9a8bd30
ARROW-10248: [Python][Dataset] Always apply Python's default write pr…
bkietz Oct 10, 2020
4679c7c
ARROW-3080: [Python] Unify Arrow to Python object conversion paths
kszucs Oct 10, 2020
ad45ed5
ARROW-10230: [JS][Doc] JavaScript documentation fails to build
kszucs Oct 10, 2020
d8af4fa
ARROW-10252: [Python] Add option to skip inclusion of Arrow headers i…
xhochy Oct 10, 2020
a86be66
ARROW-10260: [Python] Missing MapType in to_pandas_dtype()
dmarsh19 Oct 10, 2020
cd2179f
ARROW-10256: [C++][Flight] Disable -Werror carefully
kou Oct 10, 2020
5324d98
ARROW-9952: [Python] Optionally use pyarrow.dataset in parquet.write_…
jorisvandenbossche Oct 10, 2020
0a105a3
ARROW-10265: [CI] Use smaller build when cache doesn't exist on Travi…
kou Oct 10, 2020
9d90eb4
ARROW-10266: [CI][macOS] Ensure using Python 3.8 with Homebrew
kou Oct 10, 2020
4a6735a
ARROW-10267: [Python] Skip flight test if disable_server_verification…
kszucs Oct 10, 2020
79c83b2
ARROW-10240: [Rust] Optionally load data into memory before running b…
jhorstmann Oct 10, 2020
0f06c99
ARROW-7957: [Python] Handle new FileSystem in ParquetDataset by autom…
jorisvandenbossche Oct 10, 2020
8178b01
ARROW-10100: [C++][Python][Dataset] Add ParquetFileFragment::Subset m…
jorisvandenbossche Oct 10, 2020
c25fd66
ARROW-9553: [Rust] Release script doesn't bump parquet crate's arrow …
kszucs Oct 10, 2020
ba0f7f1
ARROW-10272: [Packaging][Python] Pin newer multibuild version to avoi…
kszucs Oct 10, 2020
6428aa5
ARROW-10251: [Rust] [DataFusion] MemTable::load() now loads partition…
andygrove Oct 11, 2020
dcc86f4
ARROW-10271: [Rust] Update dependencies
nevi-me Oct 11, 2020
8bbfdc4
ARROW-10234: [C++][Gandiva] Fix logic of round() for floats/decimals …
sgnkc Oct 12, 2020
0b48152
ARROW-10262: [C++] Fix TypeClass for BinaryScalar and LargeBinaryScalar
zanmato1984 Oct 12, 2020
330396a
ARROW-10286: [C++][FlightRPC] Make CMake output less confusing
lidavidm Oct 12, 2020
493d2c6
ARROW-10288: [C++] Fix compilation errors on 32-bit x86
pitrou Oct 12, 2020
e9bb109
ARROW-10287: [C++] Avoid std::random_device
pitrou Oct 12, 2020
e6eb61f
ARROW-10281: [Python] Fix warnings when running tests
jorisvandenbossche Oct 12, 2020
0419c26
ARROW-10284: [Python] Correctly suppress warning about legacy filesys…
jorisvandenbossche Oct 12, 2020
12281c4
ARROW-10285: [Python] Fix usage of deprecated num_children in pyarrow…
jorisvandenbossche Oct 12, 2020
d8f8d46
ARROW-10280: [Packaging][Python] Fix macOS wheel artifact patterns
kszucs Oct 12, 2020
da89ad1
ARROW-10273: [CI][Homebrew] Fix "brew audit" usage
kou Oct 12, 2020
6ce2793
ARROW-10279: [Release][Python] Fix verification script to align with …
kszucs Oct 12, 2020
70ae161
ARROW-10290: [C++] List POP_BACK is not available in older CMake vers…
kszucs Oct 12, 2020
0aa2069
[Release] Update CHANGELOG.md for 2.0.0
kszucs Oct 12, 2020
e46a3c6
[Release] Update .deb/.rpm changelogs for 2.0.0
kszucs Oct 12, 2020
5943421
[Release] Update versions for 2.0.0
kszucs Oct 12, 2020
4782866
[maven-release-plugin] prepare release apache-arrow-2.0.0
kszucs Oct 12, 2020
b1f36ac
[Release] Update versions for 3.0.0-SNAPSHOT
kszucs Oct 19, 2020
f72575c
[Release] Update .deb package names for 3.0.0
kszucs Oct 19, 2020
a3a35b2
ARROW-10263: [C++][Compute] Improve variance kernel numerical stability
cyb70289 Oct 13, 2020
91b5f07
ARROW-10293: [Rust] [DataFusion] Fixed benchmarks
jorgecarleitao Oct 13, 2020
a030fc5
ARROW-10295 [Rust] [DataFusion] Replace Rc<RefCell<>> by Box<> in acc…
jorgecarleitao Oct 13, 2020
34533b6
ARROW-10289: [Rust] Read dictionaries in IPC streams
nevi-me Oct 14, 2020
7209ffc
ARROW-10292: [Rust] [DataFusion] Simplify merge
jorgecarleitao Oct 14, 2020
9e671ac
ARROW-10270: [R] Fix CSV timestamp_parsers test on R-devel
nealrichardson Oct 14, 2020
8f302d3
ARROW-9479: [JS] Fix Table.from for zero-item serialized tables, Tabl…
H-Plus-Time Oct 15, 2020
03c7c02
ARROW-10145: [C++][Dataset] Assert integer overflow in partitioning f…
bkietz Oct 15, 2020
a7ef5d2
ARROW-10174: [Java] Fix reading/writing dict structs
HedgehogCode Oct 16, 2020
3f69ad2
ARROW-10236: [Rust] Add can_cast_types to arrow cast kernel, use in D…
alamb Oct 16, 2020
22027c7
ARROW-10294: [Java] Resolve problems of DecimalVector APIs on ArrowBufs
liyafan82 Oct 16, 2020
cb58146
ARROW-9475: [Java] Clean up usages of BaseAllocator, use BufferAlloca…
zhztheplayer Oct 16, 2020
3f96cc0
ARROW-10313: [C++] Faster UTF8 validation for small strings
pitrou Oct 16, 2020
36bf7a4
ARROW-9898: [C++][Gandiva] Fix linking issue with castINT/FLOAT funct…
projjal Oct 16, 2020
2fe8b86
ARROW-10311: [Release] Update crossbow verification process
kszucs Oct 16, 2020
3dbea74
ARROW-10321: [C++] Use check_cxx_source_compiles for AVX512 detect in…
frankdjx Oct 16, 2020
be21209
ARROW-5350: [Rust] Allow filtering on simple lists
nevi-me Oct 18, 2020
29e9d13
ARROW-10002: [Rust] Remove trait specialization from arrow crate
jorgecarleitao Oct 18, 2020
ad2a002
ARROW-10163: [Rust] [DataFusion] Add DictionaryArray coercion support
alamb Oct 18, 2020
b653922
ARROW-10323: [Release][wheel] Add missing verification setup step
kou Oct 19, 2020
5af12b0
ARROW-10340: [Packaging][deb][RPM] Use Python 3.8 for pygit2
kou Oct 19, 2020
6ad24c2
ARROW-10310: [C++][Gandiva] Add single argument round() in Gandiva
sgnkc Oct 19, 2020
139fb51
ARROW-10241: [C++][Compute] Add variance kernel benchmark
cyb70289 Oct 19, 2020
457935e
ARROW-10106: [FlightRPC][Java] Expose onIsReady() callback
jduo Oct 19, 2020
7944265
ARROW-10203: [Doc] Give guidance on big-endian support in the contrib…
emkornfield Oct 19, 2020
0e13e28
ARROW-9164: [C++] Add embedded documentation to compute functions
pitrou Oct 19, 2020
445251a
ARROW-9991: [C++] Split kernels for strings/binary
maartenbreddels Oct 19, 2020
04660f8
ARROW-10348: [C++] Fix crash on invalid Parquet data
pitrou Oct 19, 2020
7696b89
ARROW-10338: [Rust] Use const fn for applicable methods
vertexclique Oct 20, 2020
2321542
ARROW-10318: [C++] Use pimpl idiom in CSV parser
pitrou Oct 20, 2020
84e4b15
ARROW-10320 [Rust] [DataFusion] Migrated from batch iterators to batc…
jorgecarleitao Oct 20, 2020
760284f
ARROW-10358: [R] Followups to 2.0.0 release
nealrichardson Oct 20, 2020
f06c68a
ARROW-10331: [Rust] [DataFusion] Re-organize DataFusion errors
jorgecarleitao Oct 21, 2020
bb4f2a0
ARROW-9963: [Python] Recognize datetime.timezone.utc as UTC on conver…
jorisvandenbossche Oct 21, 2020
16412a1
ARROW-10328: [C++] Vendor fast_float number parsing library
pitrou Oct 21, 2020
625bf3f
ARROW-10363: [Python] Remove CMake bug workaround in manylinux
pitrou Oct 22, 2020
e2d8dc3
ARROW-10304: [C++][Compute] Optimize variance kernel for integers
cyb70289 Oct 22, 2020
5ad3c98
ARROW-10369: [Dev] Fix archery release utility test cases
kszucs Oct 22, 2020
843e8bb
ARROW-9304: [C++] Add "AppendEmpty" builder APIs for use inside Struc…
tianchen92 Oct 22, 2020
103489b
ARROW-10208: [C++] Fix split string kernels on sliced input
pitrou Oct 22, 2020
39a7539
ARROW-10207: [C++] Allow precomputing output string/list offsets in k…
pitrou Oct 22, 2020
982ca83
ARROW-10365: [R] Remove duplicate setting of S3 flag on macOS
nealrichardson Oct 22, 2020
9cb0e9f
ARROW-10249: [Rust] Support nested dictionaries inside list arrays
vertexclique Oct 23, 2020
b3992ea
ARROW-10333: [Java] Get rid of org.apache.arrow.util in vector
HedgehogCode Oct 23, 2020
590541d
ARROW-9747: [Java][C++] Initial Support for 256-bit Decimals
MingyuZhong Oct 23, 2020
c49b5da
ARROW-10302: [Python] Don't double-package plasma-store-server
xhochy Oct 23, 2020
eea8d35
ARROW-10375 [Rust] Removed PrimitiveArrayOps
jorgecarleitao Oct 23, 2020
da50838
ARROW-10383: [Doc] fix typos
kiszk Oct 24, 2020
aa2e6d4
ARROW-10384: [C++] Fix typos
kiszk Oct 24, 2020
221de70
ARROW-10382: [Rust] Fix typos
kiszk Oct 24, 2020
a764d3b
ARROW-10332: [Rust] Allow CSV reader to iterate from start up to end
jorgecarleitao Oct 25, 2020
7155cd5
ARROW-10135: [Rust] [Parquet] Refactor file module to help adding sou…
rdettai Oct 25, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
7 changes: 5 additions & 2 deletions .github/workflows/cpp.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,7 +102,9 @@ jobs:
# hosted machines
name: ${{ matrix.title }}
runs-on: ${{ matrix.runner }}
if: github.event_name == 'push'
# TODO(kszucs): re-enable once the self-hosted workers are properly
# registered to github
if: false && github.event_name == 'push'
defaults:
# To use certain environment variables set by .bashrc, an interactive
# bash shell must be used
Expand All @@ -119,11 +121,12 @@ jobs:
debian: 10
title: ARM32v7 Debian 10 C++
image: |
-e CPP_MAKE_PARALLELISM=4 \
-e CPP_MAKE_PARALLELISM=2 \
-e CXXFLAGS=-Wno-psabi \
-e ARROW_PARQUET=OFF \
-e ARROW_FLIGHT=OFF \
-e ARROW_GANDIVA=OFF \
-e ARROW_ORC=OFF \
-e CMAKE_ARGS=-DARROW_CPU_FLAG=armv7 \
debian-cpp
arch: 'arm32v7'
Expand Down
5 changes: 4 additions & 1 deletion .github/workflows/dev_labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,10 @@

name: PR labeler
on:
- pull_request_target
pull_request_target:
types: [opened, reopened]
paths:
- 'rust/**'

jobs:
assign-rust-labels:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/dev_labeler/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,5 +18,5 @@
lang-rust:
- rust/**/*

datafusion:
datafusion:
- rust/datafusion/**/*
7 changes: 4 additions & 3 deletions .github/workflows/python.yml
Original file line number Diff line number Diff line change
Expand Up @@ -116,7 +116,7 @@ jobs:
run: archery docker push ${{ matrix.image }}

macos:
name: AMD64 MacOS 10.15 Python 3.7
name: AMD64 MacOS 10.15 Python 3
runs-on: macos-latest
if: ${{ !contains(github.event.pull_request.title, 'WIP') }}
env:
Expand Down Expand Up @@ -150,8 +150,9 @@ jobs:
brew update --preinstall
brew bundle --file=cpp/Brewfile
brew install coreutils python
pip3 install -r python/requirements-build.txt \
-r python/requirements-test.txt
python3 -mpip install \
-r python/requirements-build.txt \
-r python/requirements-test.txt
- name: Build
shell: bash
run: |
Expand Down
23 changes: 11 additions & 12 deletions .github/workflows/r.yml
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@ jobs:
strategy:
fail-fast: false
matrix:
r: ["3.6", "4.0"]
r: ["3.6"]
ubuntu: [18.04]
env:
R: ${{ matrix.r }}
Expand Down Expand Up @@ -92,21 +92,20 @@ jobs:
continue-on-error: true
run: archery docker push ubuntu-r

rstudio:
name: "rstudio/r-base:${{ matrix.r_version }}-${{ matrix.r_image }}"
bundled:
name: "${{ matrix.config.org }}/${{ matrix.config.image }}:${{ matrix.config.tag }}"
runs-on: ubuntu-latest
if: ${{ !contains(github.event.pull_request.title, 'WIP') }}
strategy:
fail-fast: false
matrix:
# See https://hub.docker.com/r/rstudio/r-base
r_version: ["3.6", "4.0"]
r_image:
- centos7
config:
- {org: 'rstudio', image: 'r-base', tag: '4.0-centos7'}
- {org: 'rhub', image: 'debian-gcc-devel', tag: 'latest'}
env:
R_ORG: rstudio
R_IMAGE: r-base
R_TAG: ${{ matrix.r_version }}-${{ matrix.r_image }}
R_ORG: ${{ matrix.config.org }}
R_IMAGE: ${{ matrix.config.image }}
R_TAG: ${{ matrix.config.tag }}
steps:
- name: Checkout Arrow
uses: actions/checkout@v2
Expand All @@ -120,8 +119,8 @@ jobs:
uses: actions/cache@v1
with:
path: .docker
key: ${{ matrix.r_image }}-r-${{ hashFiles('cpp/**') }}
restore-keys: ${{ matrix.r_image }}-r-
key: ${{ matrix.config.image }}-r-${{ hashFiles('cpp/**') }}
restore-keys: ${{ matrix.config.image }}-r-
- name: Setup Python
uses: actions/setup-python@v1
with:
Expand Down
20 changes: 20 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,13 +42,29 @@ jobs:
DOCKER_IMAGE_ID: ubuntu-cpp
# ARROW_USE_GLOG=OFF is needed to avoid build error caused by
# glog and CMAKE_UNITY_BUILD=ON.
#
# Disable ARROW_S3 because it often causes "No output has
# been received in the last 10m0s, this potentially indicates
# a stalled build or something wrong with the build itself."
# on Travis CI.
DOCKER_RUN_ARGS: >-
"
-e ARROW_BUILD_STATIC=OFF
-e ARROW_ORC=OFF
-e ARROW_S3=OFF
-e ARROW_USE_GLOG=OFF
-e CMAKE_UNITY_BUILD=ON
"
# We need to use smaller build when cache doesn't exist
# because Travis CI has "No output has been received in the
# last 10m0s" limitation. If we build many modules, we reach
# the limitation.
DOCKER_RUN_ARGS_NO_CACHE: >-
"
-e ARROW_BUILD_TESTS=OFF
-e ARROW_GANDIVA=OFF
-e ARROW_PARQUET=OFF
"
UBUNTU: "20.04"

- name: "C++ on s390x"
Expand Down Expand Up @@ -124,6 +140,10 @@ script:
# /home/travis/.travis/functions: line 109: ulimit: core file size: cannot modify limit: Operation not permitted
- |
ulimit -c unlimited || :
- |
if [ $(ls $TRAVIS_BUILD_DIR/.docker | wc -l) -eq 0 ]; then
DOCKER_RUN_ARGS="${DOCKER_RUN_ARGS} ${DOCKER_RUN_ARGS_NO_CACHE}"
fi
- |
archery docker run \
${DOCKER_RUN_ARGS} \
Expand Down
Loading