|
12 | 12 | limitations under the License. See accompanying LICENSE file. |
13 | 13 | --> |
14 | 14 |
|
| 15 | +# Apache Arrow 0.5.0 (23 July 2017) |
| 16 | + |
| 17 | +## Bug |
| 18 | + |
| 19 | +* ARROW-1074 - from_pandas doesnt convert ndarray to list |
| 20 | +* ARROW-1079 - [Python] Empty "private" directories should be ignored by Parquet interface |
| 21 | +* ARROW-1081 - C++: arrow::test::TestBase::MakePrimitive doesn't fill null_bitmap |
| 22 | +* ARROW-1096 - [C++] Memory mapping file over 4GB fails on Windows |
| 23 | +* ARROW-1097 - Reading tensor needs file to be opened in writeable mode |
| 24 | +* ARROW-1098 - Document Error? |
| 25 | +* ARROW-1101 - UnionListWriter is not implementing all methods on interface ScalarWriter |
| 26 | +* ARROW-1103 - [Python] Utilize pandas metadata from common _metadata Parquet file if it exists |
| 27 | +* ARROW-1107 - [JAVA] NullableMapVector getField() should return nullable type |
| 28 | +* ARROW-1108 - Check if ArrowBuf is empty buffer in getActualConsumedMemory() and getPossibleConsumedMemory() |
| 29 | +* ARROW-1109 - [JAVA] transferOwnership fails when readerIndex is not 0 |
| 30 | +* ARROW-1110 - [JAVA] make union vector naming consistent |
| 31 | +* ARROW-1111 - [JAVA] Make aligning buffers optional, and allow -1 for unknown null count |
| 32 | +* ARROW-1112 - [JAVA] Set lastSet for VarLength and List vectors when loading |
| 33 | +* ARROW-1113 - [C++] gflags EP build gets triggered (as a no-op) on subsequent calls to make or ninja build |
| 34 | +* ARROW-1115 - [C++] Use absolute path for ccache |
| 35 | +* ARROW-1117 - [Docs] Minor issues in GLib README |
| 36 | +* ARROW-1124 - [Python] pyarrow needs to depend on numpy>=1.10 (not 1.9) |
| 37 | +* ARROW-1125 - Python: Table.from_pandas doesn't work anymore on partial schemas |
| 38 | +* ARROW-1128 - [Docs] command to build a wheel is not properly rendered |
| 39 | +* ARROW-1129 - [C++] Fix Linux toolchain build regression from ARROW-742 |
| 40 | +* ARROW-1131 - Python: Parquet unit tests are always skipped |
| 41 | +* ARROW-1132 - [Python] Unable to write pandas DataFrame w/MultiIndex containing duplicate values to parquet |
| 42 | +* ARROW-1136 - [C++/Python] Segfault on empty stream |
| 43 | +* ARROW-1138 - Travis: Use OpenJDK7 instead of OracleJDK7 |
| 44 | +* ARROW-1139 - [C++] dlmalloc doesn't allow arrow to be built with clang 4 or gcc 7.1.1 |
| 45 | +* ARROW-1141 - on import get libjemalloc.so.2: cannot allocate memory in static TLS block |
| 46 | +* ARROW-1143 - C++: Fix comparison of NullArray |
| 47 | +* ARROW-1144 - [C++] Remove unused variable |
| 48 | +* ARROW-1150 - [C++] AdaptiveIntBuilder compiler warning on MSVC |
| 49 | +* ARROW-1152 - [Cython] read_tensor should work with a readable file |
| 50 | +* ARROW-1155 - segmentation fault when run pa.Int16Value() |
| 51 | +* ARROW-1157 - C++/Python: Decimal templates are not correctly exported on OSX |
| 52 | +* ARROW-1159 - [C++] Static data members cannot be accessed from inline functions in Arrow headers by thirdparty users |
| 53 | +* ARROW-1162 - Transfer Between Empty Lists Should Not Invoke Callback |
| 54 | +* ARROW-1166 - Errors in Struct type's example and missing reference in Layout.md |
| 55 | +* ARROW-1167 - [Python] Create chunked BinaryArray in Table.from_pandas when a column's data exceeds 2GB |
| 56 | +* ARROW-1168 - [Python] pandas metadata may contain "mixed" data types |
| 57 | +* ARROW-1169 - C++: jemalloc externalproject doesn't build with CMake's ninja generator |
| 58 | +* ARROW-1170 - C++: ARROW_JEMALLOC=OFF breaks linking on unittest |
| 59 | +* ARROW-1174 - [GLib] Investigate root cause of ListArray glib test failure |
| 60 | +* ARROW-1177 - [C++] Detect int32 overflow in ListBuilder::Append |
| 61 | +* ARROW-1179 - C++: Add missing virtual destructors |
| 62 | +* ARROW-1180 - [GLib] garrow_tensor_get_dimension_name() returns invalid address |
| 63 | +* ARROW-1181 - [Python] Parquet test fail if not enabled |
| 64 | +* ARROW-1182 - C++: Specify BUILD_BYPRODUCTS for zlib and zstd |
| 65 | +* ARROW-1186 - [C++] Enable option to build arrow with minimal dependencies needed to build Parquet library |
| 66 | +* ARROW-1188 - Segfault when trying to serialize a DataFrame with Null-only Categorical Column |
| 67 | +* ARROW-1190 - VectorLoader corrupts vectors with duplicate names |
| 68 | +* ARROW-1191 - [JAVA] Implement getField() method for the complex readers |
| 69 | +* ARROW-1194 - Getting record batch size with pa.get_record_batch_size returns a size that is too small for pandas DataFrame. |
| 70 | +* ARROW-1197 - [GLib] record_batch.hpp Inclusion is missing |
| 71 | +* ARROW-1200 - [C++] DictionaryBuilder should use signed integers for indices |
| 72 | +* ARROW-1201 - [Python] Incomplete Python types cause a core dump when repr-ing |
| 73 | +* ARROW-1203 - [C++] Disallow BinaryBuilder to append byte strings larger than the maximum value of int32_t |
| 74 | +* ARROW-1205 - C++: Reference to type objects in ArrayLoader may cause segmentation faults. |
| 75 | +* ARROW-1206 - [C++] Enable MSVC builds to work with some compression library support disabled |
| 76 | +* ARROW-1208 - [C++] Toolchain build with ZSTD library from conda-forge failure |
| 77 | +* ARROW-1215 - [Python] Class methods in API reference |
| 78 | +* ARROW-1216 - Numpy arrays cannot be created from Arrow Buffers on Python 2 |
| 79 | +* ARROW-1218 - Arrow doesn't compile if all compression libraries are deactivated |
| 80 | +* ARROW-1222 - [Python] pyarrow.array returns NullArray for array of unsupported Python objects |
| 81 | +* ARROW-1223 - [GLib] Fix function name that returns wrapped object |
| 82 | +* ARROW-1235 - [C++] macOS linker failure with operator<< and std::ostream |
| 83 | +* ARROW-1236 - Library paths in exported pkg-config file are incorrect |
| 84 | +* ARROW-601 - Some logical types not supported when loading Parquet |
| 85 | +* ARROW-784 - Cleaning up thirdparty toolchain support in Arrow on Windows |
| 86 | +* ARROW-992 - [Python] In place development builds do not have a __version__ |
| 87 | + |
| 88 | +## Improvement |
| 89 | + |
| 90 | +* ARROW-1041 - [Python] Support read_pandas on a directory of Parquet files |
| 91 | +* ARROW-1100 - [Python] Add "mode" property to NativeFile instances |
| 92 | +* ARROW-1102 - Make MessageSerializer.serializeMessage() public |
| 93 | +* ARROW-1120 - [Python] Write support for int96 |
| 94 | +* ARROW-1137 - Python: Ensure Pandas roundtrip of all-None column |
| 95 | +* ARROW-1148 - [C++] Raise minimum CMake version to 3.2 |
| 96 | +* ARROW-1151 - [C++] Add gcc branch prediction to status check macro |
| 97 | +* ARROW-1160 - C++: Implement DictionaryBuilder |
| 98 | +* ARROW-1165 - [C++] Refactor PythonDecimalToArrowDecimal to not use templates |
| 99 | +* ARROW-1185 - [C++] Clean up arrow::Status implementation, add warn_unused_result attribute for clang |
| 100 | +* ARROW-1187 - Serialize a DataFrame with None column |
| 101 | +* ARROW-1193 - [C++] Support pkg-config forarrow_python.so |
| 102 | +* ARROW-1196 - [C++] Appveyor separate jobs for Debug/Release builds from sources; Build with conda toolchain; Build with NMake Makefiles Generator |
| 103 | +* ARROW-1199 - [C++] Introduce mutable POD struct for generic array data |
| 104 | +* ARROW-1202 - Remove semicolons from status macros |
| 105 | +* ARROW-1217 - [GLib] Add GInputStream based arrow::io::RandomAccessFile |
| 106 | +* ARROW-1220 - [C++] Standartize usage of *_HOME cmake script variables for 3rd party libs |
| 107 | +* ARROW-1221 - [C++] Pin clang-format version |
| 108 | +* ARROW-1229 - [GLib] Follow Reader API change (get -> read) |
| 109 | +* ARROW-742 - Handling exceptions during execution of std::wstring_convert |
| 110 | +* ARROW-834 - [Python] Support creating Arrow arrays from Python iterables |
| 111 | +* ARROW-915 - Struct Array reads limited support |
| 112 | +* ARROW-935 - [Java] Build Javadoc in Travis CI |
| 113 | +* ARROW-960 - [Python] Add source build guide for macOS + Homebrew |
| 114 | +* ARROW-962 - [Python] Add schema attribute to FileReader |
| 115 | +* ARROW-966 - [Python] pyarrow.list_ should also accept Field instance |
| 116 | +* ARROW-978 - [Python] Use sphinx-bootstrap-theme for Sphinx documentation |
| 117 | + |
| 118 | +## New Feature |
| 119 | + |
| 120 | +* ARROW-1048 - Allow user LD_LIBRARY_PATH to be used with source release script |
| 121 | +* ARROW-1073 - C++: Adapative integer builder |
| 122 | +* ARROW-1095 - [Website] Add Arrow icon asset |
| 123 | +* ARROW-111 - [C++] Add static analyzer to tool chain to verify checking of Status returns |
| 124 | +* ARROW-1122 - [Website] Guest blog post on Arrow + ODBC from turbodbc |
| 125 | +* ARROW-1123 - C++: Make jemalloc the default allocator |
| 126 | +* ARROW-1135 - Upgrade Travis CI clang builds to use LLVM 4.0 |
| 127 | +* ARROW-1142 - [C++] Move over compression library toolchain from parquet-cpp |
| 128 | +* ARROW-1145 - [GLib] Add get_values() |
| 129 | +* ARROW-1154 - [C++] Migrate more computational utility code from parquet-cpp |
| 130 | +* ARROW-1183 - [Python] Implement time type conversions in to_pandas |
| 131 | +* ARROW-1198 - Python: Add public C++ API to unwrap PyArrow object |
| 132 | +* ARROW-1212 - [GLib] Add garrow_binary_array_get_offsets_buffer() |
| 133 | +* ARROW-1214 - [Python] Add classes / functions to enable stream message components to be handled outside of the stream reader class |
| 134 | +* ARROW-1227 - [GLib] Support GOutputStream |
| 135 | +* ARROW-460 - [C++] Implement JSON round trip for DictionaryArray |
| 136 | +* ARROW-462 - [C++] Implement in-memory conversions between non-nested primitive types and DictionaryArray equivalent |
| 137 | +* ARROW-575 - Python: Auto-detect nested lists and nested numpy arrays in Pandas |
| 138 | +* ARROW-597 - [Python] Add convenience function to yield DataFrame from any object that a StreamReader or FileReader can read from |
| 139 | +* ARROW-599 - [C++] Add LZ4 codec to 3rd-party toolchain |
| 140 | +* ARROW-600 - [C++] Add ZSTD codec to 3rd-party toolchain |
| 141 | +* ARROW-692 - Java<->C++ Integration tests for dictionary-encoded vectors |
| 142 | +* ARROW-693 - [Java] Add JSON support for dictionary vectors |
| 143 | + |
| 144 | +## Task |
| 145 | + |
| 146 | +* ARROW-1052 - Arrow 0.5.0 release |
| 147 | + |
| 148 | +## Test |
| 149 | + |
| 150 | +* ARROW-1228 - [GLib] Test file name should be the same name as target class |
| 151 | +* ARROW-1233 - [C++] Validate cmake script resolving of 3rd party linked libs from correct location in toolchain build |
| 152 | + |
15 | 153 | # Apache Arrow 0.4.1 (9 June 2017) |
16 | 154 |
|
17 | 155 | ## Bug |
|
0 commit comments