Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1387 commits
Select commit Hold shift + click to select a range
f5f05d5
[SPARK-30310][CORE] Resolve missing match case in SparkUncaughtExcept…
tinhto-000 Jan 17, 2020
fd308ad
[SPARK-30041][SQL][WEBUI] Add Codegen Stage Id to Stage DAG visualiza…
LucaCanali Jan 17, 2020
830e635
[SPARK-27868][CORE][FOLLOWUP] Recover the default value to -1 again
xCASx Jan 17, 2020
abf759a
[SPARK-29876][SS] Delete/archive file source completed files in separ…
gaborgsomogyi Jan 17, 2020
fdbded3
[SPARK-30312][DOCS][FOLLOWUP] Add a migration guide
dongjoon-hyun Jan 17, 2020
96a3445
[SPARK-25993][SQL][TESTS] Add test cases for CREATE EXTERNAL TABLE wi…
kevinyu98 Jan 18, 2020
505693c
[SPARK-28152][DOCS][FOLLOWUP] Add a migration guide for MsSQLServer J…
dongjoon-hyun Jan 18, 2020
3228732
[SPARK-30533][ML][PYSPARK] Add classes to represent Java Regressors a…
zero323 Jan 18, 2020
a3357df
[SPARK-30544][BUILD] Upgrade the version of Genjavadoc to 0.15
sarutak Jan 18, 2020
a6bdea3
[SPARK-30539][PYTHON][SQL] Add DataFrame.tail in PySpark
HyukjinKwon Jan 18, 2020
ef1af43
[MINOR][DOCS] Remove note about -T for parallel build
srowen Jan 18, 2020
789a4ab
[MINOR][HIVE] Pick up HIVE-22708 HTTP transport fix
srowen Jan 18, 2020
0d99d7e
[SPARK-30524] [SQL] follow up SPARK-30524 to resolve comments
Jan 19, 2020
17857f9
[SPARK-30551][SQL] Disable comparison for interval type
yaooqinn Jan 19, 2020
d4c6ec6
[SPARK-30530][SQL] Fix filter pushdown for bad CSV records
MaxGekk Jan 19, 2020
f14061c
[SPARK-30371][K8S] Add spark.kubernetes.driver.master conf
wackxu Jan 19, 2020
19a1059
[SPARK-30282][DOCS][FOLLOWUP] Update SQL migration guide for SHOW TBL…
imback82 Jan 19, 2020
3858e94
[SPARK-30566][BUILD] Iterator doesn't refer outer identifier named "i…
sarutak Jan 20, 2020
c992716
[SPARK-30572][BUILD] Add a fallback Maven repository
dongjoon-hyun Jan 20, 2020
a2081ae
[SPARK-29290][CORE] Update to chill 0.9.5
srowen Jan 20, 2020
775fae4
[SPARK-30486][BUILD] Bump lz4-java version to 1.7.1
maropu Jan 20, 2020
d50f8df
[SPARK-30413][SQL] Avoid WrappedArray roundtrip in GenericArrayData c…
JoshRosen Jan 20, 2020
4806cc5
[SPARK-30547][SQL] Add unstable annotation to the CalendarInterval class
yaooqinn Jan 20, 2020
00039cc
[SPARK-30554][SQL] Return `Iterable` from `FailureSafeParser.rawParser`
MaxGekk Jan 20, 2020
ab04899
[SPARK-30558][SQL] Avoid rebuilding `AvroOptions` per each partition
MaxGekk Jan 20, 2020
b5cb9ab
[SPARK-30535][SQL] Migrate ALTER TABLE commands to the new framework
imback82 Jan 20, 2020
f5b345c
[SPARK-30578][SQL][TEST] Explicitly set conf to use DSv2 for orc in O…
Ngone51 Jan 20, 2020
fd69533
[SPARK-30482][CORE][SQL][TESTS][FOLLOW-UP] Output caller info in log …
MaxGekk Jan 21, 2020
14bc2a2
[SPARK-30530][SQL][FOLLOW-UP] Remove unnecessary codes and fix commen…
HyukjinKwon Jan 21, 2020
24efa43
[SPARK-30019][SQL] Add the owner property to v2 table
yaooqinn Jan 21, 2020
0388b7a
[SPARK-30568][SQL] Invalidate interval type as a field table schema
yaooqinn Jan 21, 2020
94284c8
[SPARK-30587][SQL][TESTS] Add test suites for CSV and JSON v1
MaxGekk Jan 21, 2020
2d59ca4
[SPARK-30475][SQL] File source V2: Push data filters for file listing
guykhazma Jan 21, 2020
78df532
[SPARK-30433][SQL][FOLLOW-UP] Optimize collect conflict plans
Ngone51 Jan 21, 2020
595cdb0
[SPARK-30571][CORE] fix splitting shuffle fetch requests
cloud-fan Jan 21, 2020
a94a4fc
[MINOR][DOCS] Fix Jenkins build image and link in README.md
HyukjinKwon Jan 21, 2020
e170422
Revert "[SPARK-30534][INFRA] Use mvn in `dev/scalastyle`"
HyukjinKwon Jan 21, 2020
730388b
[SPARK-30547][SQL][FOLLOWUP] Update since anotation for CalendarInter…
yaooqinn Jan 21, 2020
af70542
[SPARK-30593][SQL] Revert interval ISO/ANSI SQL Standard output since…
yaooqinn Jan 21, 2020
ff39c92
[SPARK-30252][SQL] Disallow negative scale of Decimal
Ngone51 Jan 21, 2020
cfb1706
[SPARK-15616][SQL] Add optimizer rule PruneHiveTablePartitions
fuwhu Jan 21, 2020
a131031
[SPARK-30599][CORE][TESTS] Increase the maximum number of log events …
MaxGekk Jan 21, 2020
3c4e619
[SPARK-30553][DOCS] fix structured-streaming java example error
Jan 22, 2020
ab0890b
[SPARK-28264][PYTHON][SQL] Support type hints in pandas UDF and renam…
HyukjinKwon Jan 22, 2020
f2d71f5
[SPARK-30591][SQL] Remove the nonstandard SET OWNER syntax for namesp…
yaooqinn Jan 22, 2020
b8cb52a
[SPARK-30555][SQL] MERGE INTO insert action should only access column…
cloud-fan Jan 22, 2020
1c46bd9
[SPARK-30503][ML] OnlineLDAOptimizer does not handle persistance corr…
zhengruifeng Jan 22, 2020
8097b7e
[SPARK-30573][DOC] Document WHERE Clause of SELECT statement in SQL R…
dilipbiswal Jan 22, 2020
a6030ef
[SPARK-30575][DOC] Document HAVING Clause of SELECT statement in SQL …
dilipbiswal Jan 22, 2020
8f7f4d5
[SPARK-30583][DOC] Document LIMIT Clause of SELECT statement in SQL R…
dilipbiswal Jan 22, 2020
8e280ce
[SPARK-30592][SQL] Interval support for csv and json funtions
yaooqinn Jan 22, 2020
6dfaa07
[SPARK-30549][SQL] Fix the subquery shown issue in UI When enable AQE
Jan 22, 2020
84f1154
[SPARK-30604][CORE] Fix a log message by including hostLocalBlockByte…
Udbhav30 Jan 22, 2020
4ca31b4
[SPARK-30606][SQL] Fix the `like` function with 2 parameters
MaxGekk Jan 22, 2020
2e74dba
[SPARK-30574][DOC] Document GROUP BY Clause of SELECT statement in SQ…
dilipbiswal Jan 23, 2020
38f4e59
[SPARK-28801][DOC] Document SELECT statement in SQL Reference (Main p…
dilipbiswal Jan 23, 2020
eccae13
[SPARK-30531][WEB UI] Do not render plan viz when it exists already
EnricoMi Jan 23, 2020
bbab2bb
[SPARK-30556][SQL] Copy sparkContext.localproperties to child thread …
ajithme Jan 23, 2020
d2bca8f
[SPARK-30609] Allow default merge command resolution to be bypassed b…
tdas Jan 23, 2020
db528e4
[SPARK-30535][SQL] Revert "[] Migrate ALTER TABLE commands to the new…
brkyvz Jan 23, 2020
cd9ccdc
[SPARK-30601][BUILD] Add a Google Maven Central as a primary repository
HyukjinKwon Jan 23, 2020
2330a56
[SPARK-30607][SQL][PYSPARK][SPARKR] Add overlay wrappers for SparkR a…
zero323 Jan 23, 2020
f35f352
[SPARK-30543][ML][PYSPARK][R] RandomForest add Param bootstrap to con…
zhengruifeng Jan 23, 2020
d0bf447
[SPARK-30575][DOCS][FOLLOWUP] Fix typos in documents
huaxingao Jan 23, 2020
ffd435b
[SPARK-27871][SQL][FOLLOW-UP] Remove the conf spark.sql.optimizer.rea…
gatorsmile Jan 23, 2020
dbed4c7
[SPARK-30605][SQL] move defaultNamespace from SupportsNamespace to Ca…
cloud-fan Jan 23, 2020
705fc6a
[SPARK-30188][SQL][TESTS][FOLLOW-UP] Remove `sorted` in asserts of co…
MaxGekk Jan 23, 2020
3d7359a
[SPARK-29175][SQL][FOLLOW-UP] Rename the config name to spark.sql.mav…
xuanyuanking Jan 23, 2020
3c8b360
[SPARK-30620][SQL] avoid unnecessary serialization in AggregateExpres…
cloud-fan Jan 23, 2020
afe70b3
[SPARK-28794][SQL][DOC] Documentation for Create table Command
PavithraRamachandran Jan 23, 2020
843224e
[SPARK-30570][BUILD] Update scalafmt plugin to 1.0.3 with onlyChanged…
koeninger Jan 23, 2020
976946a
[SPARK-29947][SQL][FOLLOWUP] Fix table lookup cache
cloud-fan Jan 23, 2020
3228d72
[SPARK-30603][SQL] Move RESERVED_PROPERTIES from SupportsNamespaces a…
yaooqinn Jan 23, 2020
4847f73
[SPARK-30298][SQL] Respect aliases in output partitioning of projects…
imback82 Jan 23, 2020
3f76bd4
[SPARK-27083][SQL][FOLLOW-UP] Rename spark.sql.subquery.reuse to spar…
gatorsmile Jan 23, 2020
ddf8315
[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the S…
gatorsmile Jan 24, 2020
53fd83a
[MINOR][DOCS] Fix src/dest type documentation for `to_timestamp`
deepyaman Jan 24, 2020
ed44926
[SPARK-30627][SQL] Disable all the V2 file sources by default
gengliangwang Jan 24, 2020
d1a673a
[SPARK-29924][DOCS] Document Apache Arrow JDK11 requirement
dongjoon-hyun Jan 24, 2020
f86a1b9
[SPARK-30626][K8S] Add SPARK_APPLICATION_ID into driver pod env
Jeffwan Jan 24, 2020
2f8e4d0
[SPARK-30630][ML] Remove numTrees in GBT in 3.0.0
huaxingao Jan 24, 2020
a0e63b6
[SPARK-29721][SQL] Prune unnecessary nested fields from Generate with…
viirya Jan 25, 2020
8629597
[SPARK-30639][BUILD] Upgrade Jersey to 2.30
dongjoon-hyun Jan 25, 2020
d5b92b2
[SPARK-30579][DOC] Document ORDER BY Clause of SELECT statement in SQ…
dilipbiswal Jan 26, 2020
40b1f4d
[SPARK-30645][SPARKR][TESTS][WINDOWS] Move Unicode test data to exter…
zero323 Jan 26, 2020
c3f17fa
[SPARK-29777][FOLLOW-UP][SPARKR] Remove no longer valid test for recu…
zero323 Jan 26, 2020
d69ed9a
Revert "[SPARK-25496][SQL] Deprecate from_utc_timestamp and to_utc_ti…
gatorsmile Jan 26, 2020
48f6478
[SPARK-30644][SQL][TEST] Remove query index from the golden files of …
gatorsmile Jan 26, 2020
d0800fc
[SPARK-30314] Add identifier and catalog information to DataSourceV2R…
yuchenhuo Jan 26, 2020
43d9c7e
[SPARK-30640][PYTHON][SQL] Prevent unnecessary copies of data during …
BryanCutler Jan 26, 2020
7e1b991
[SPARK-30581][DOC] Document SORT BY Clause of SELECT statement in SQL…
dilipbiswal Jan 27, 2020
5781e57
[SPARK-30589][DOC] Document DISTRIBUTE BY Clause of SELECT statement …
dilipbiswal Jan 27, 2020
8a24cf2
[SPARK-30588][DOC] Document CLUSTER BY Clause of SELECT statement in …
dilipbiswal Jan 27, 2020
0436b3d
[SPARK-30653][INFRA][SQL] EOL character enforcement for java/scala/xm…
HeartSaVioR Jan 27, 2020
c5c580b
[SPARK-30633][SQL] Append L to seed when type is LongType
patrickcording Jan 27, 2020
8aebc80
[SPARK-30625][SQL] Support `escape` as third parameter of the `like` …
MaxGekk Jan 27, 2020
96d2727
[SPARK-30642][ML][PYSPARK] LinearSVC blockify input vectors
zhengruifeng Jan 28, 2020
580c2b7
[SPARK-27166][SQL][FOLLOWUP] Refactor to build string once
dongjoon-hyun Jan 28, 2020
a2fe73b
[SPARK-30481][CORE] Integrate event log compactor into Spark History …
HeartSaVioR Jan 29, 2020
298d0a5
[SPARK-23435][SPARKR][TESTS] Update testthat to >= 2.0.0
zero323 Jan 29, 2020
ec1fb6b
[SPARK-30234][SQL][FOLLOWUP] Add `.enabled` in the suffix of the ADD …
maropu Jan 29, 2020
3e203c9
[SPARK-28801][DOC][FOLLOW-UP] Setup links and address other review co…
dilipbiswal Jan 29, 2020
d0f635e
[SPARK-30582][WEBUI] Spark UI is not showing Aggregated Metrics by Ex…
SaurabhChawla100 Jan 29, 2020
6b47ace
[SPARK-30512] Added a dedicated boss event loop group
otterc Jan 29, 2020
7173786
[SPARK-29543][SS][UI] Structured Streaming Web UI
uncleGen Jan 29, 2020
e5c7f89
[SPARK-30529][CORE] Improve error messages when Executor dies before …
tgravescs Jan 29, 2020
b1f81f0
[MINOR][SQL][DOCS] Fix typos in scaladoc strings of higher order func…
zero323 Jan 30, 2020
c228810
[SPARK-30672][BUILD] Add numpy to API docs readme
nchammas Jan 30, 2020
246c398
[SPARK-30435][DOC] Update doc of Supported Hive Features
AngersZhuuuu Jan 30, 2020
bda0669
[SPARK-30665][DOCS][BUILD][PYTHON] Eliminate pypandoc dependency
nchammas Jan 30, 2020
561e9b9
[SPARK-30674][INFRA] Use python3 in dev/lint-python
dongjoon-hyun Jan 30, 2020
073ce12
[SPARK-30659][ML][PYSPARK] LogisticRegression blockify input vectors
zhengruifeng Jan 30, 2020
a291433
[SPARK-30678][MLLIB][TESTS] Eliminate warnings from deprecated Bisect…
MaxGekk Jan 30, 2020
b0db623
[SPARK-29020][FOLLOWUP][SQL] Update description of array_sort function
kiszk Jan 30, 2020
7503e76
[SPARK-30622][SQL] commands should return dummy statistics
cloud-fan Jan 30, 2020
e5f572a
[SPARK-30680][SQL] ResolvedNamespace does not require a namespace cat…
cloud-fan Jan 30, 2020
f59685a
[SPARK-30662][ML][PYSPARK] ALS/MLP extend HasBlockSize
huaxingao Jan 30, 2020
5916c7d
[SPARK-30673][SQL][TESTS] Test cases in HiveShowCreateTableSuite shou…
viirya Jan 30, 2020
f56ba37
[SPARK-30656][SS] Support the "minPartitions" option in Kafka batch s…
zsxwing Jan 31, 2020
cbb714f
[SPARK-29438][SS] Use partition ID of StateStoreAwareZipPartitionsRDD…
HeartSaVioR Jan 31, 2020
ca3a64b
[SPARK-30481][CORE][FOLLOWUP] Execute log compaction only when merge …
HeartSaVioR Jan 31, 2020
9f42be2
[SPARK-29665][SQL] refine the TableProvider interface
cloud-fan Jan 31, 2020
5f3ec62
[SPARK-30362][CORE] Update InputMetrics in DataSourceRDD
sandeep-katta Jan 31, 2020
1cd19ad
[SPARK-30669][SS] Introduce AdmissionControl APIs for StructuredStrea…
brkyvz Jan 31, 2020
05be81d
[SPARK-30192][SQL][FOLLOWUP] Rename SINGLETON to INSTANCE
dongjoon-hyun Jan 31, 2020
a5c7090
[SPARK-30671][SQL] emptyDataFrame should use a LocalRelation
hvanhovell Jan 31, 2020
6fac411
[SPARK-29093][ML][PYSPARK][FOLLOW-UP] Remove duplicate setter
huaxingao Jan 31, 2020
290a528
[SPARK-30615][SQL] Introduce Analyzer rule for V2 AlterTable column c…
brkyvz Jan 31, 2020
6f4703e
[SPARK-30690][DOCS][BUILD] Add CalendarInterval into API documentation
HyukjinKwon Jan 31, 2020
3d2b8d8
[SPARK-30638][CORE] Add resources allocated to PluginContext
tgravescs Jan 31, 2020
21bc047
[SPARK-30511][SPARK-28403][CORE] Don't treat failed/killed speculativ…
Jan 31, 2020
5ccbb38
[SPARK-29938][SQL][FOLLOW-UP] Improve AlterTableAddPartitionCommand
Ngone51 Jan 31, 2020
481e521
[SPARK-30657][SPARK-30658][SS] Fixed two bugs in streaming limits
tdas Jan 31, 2020
ff0f636
[SPARK-30638][CORE][FOLLOWUP] Fix a spacing issue and use UTF-8 inste…
tgravescs Jan 31, 2020
5e0faf9
[SPARK-29779][SPARK-30479][CORE][SQL][FOLLOWUP] Reflect review commen…
HeartSaVioR Jan 31, 2020
5eac2dc
[SPARK-30691][SQL][DOC] Add a few main pages to SQL Reference
huaxingao Jan 31, 2020
33546d6
Revert "[SPARK-30036][SQL] Fix: REPARTITION hint does not work with o…
cloud-fan Jan 31, 2020
18bc4e5
[SPARK-30684][WEBUI] Show the descripton of metrics for WholeStageCod…
sarutak Jan 31, 2020
387ce89
[SPARK-27324][DOC][CORE] Document configurations related to executor …
wypoon Jan 31, 2020
2d4b5ea
[SPARK-30676][CORE][TESTS] Eliminate warnings from deprecated constru…
MaxGekk Jan 31, 2020
82b4f75
[SPARK-30508][SQL] Add SparkSession.executeCommand API for external d…
Ngone51 Jan 31, 2020
2fd15a2
[SPARK-30695][BUILD] Upgrade Apache ORC to 1.5.9
dongjoon-hyun Feb 1, 2020
d0c3e9f
[SPARK-30660][ML][PYSPARK] LinearRegression blockify input vectors
zhengruifeng Feb 1, 2020
8eecc20
[SPARK-27946][SQL] Hive DDL to Spark DDL conversion USING "show creat…
viirya Feb 1, 2020
878094f
[SPARK-30689][CORE][YARN] Add resource discovery plugin api to suppor…
tgravescs Feb 1, 2020
3538095
[SPARK-30698][BUILD] Bumps checkstyle from 8.25 to 8.29
Feb 1, 2020
534f5d4
[SPARK-29138][PYTHON][TEST] Increase timeout of StreamingLogisticRegr…
dongjoon-hyun Feb 1, 2020
da32d1e
[SPARK-30700][ML] NaiveBayesModel predict optimization
zhengruifeng Feb 1, 2020
cd5f03a
[SPARK-27686][DOC][SQL] Update migration guide for make Hive 2.3 depe…
wangyum Feb 2, 2020
1adf352
[SPARK-30704][INFRA] Use jekyll-redirect-from 0.15.0 instead of the l…
dongjoon-hyun Feb 2, 2020
fb321b6
[MINOR][SPARKR][DOCS] Remove duplicate @name tags from read.df and re…
zero323 Feb 3, 2020
2eccfd8
[SPARK-30697][SQL] Handle database and namespace exceptions in catalo…
brkyvz Feb 3, 2020
a4912ce
[SPARK-29543][SS][FOLLOWUP] Move `spark.sql.streaming.ui.*` configs t…
xuanyuanking Feb 3, 2020
47659a0
[SPARK-30525][SQL] HiveTableScanExec do not need to prune partitions …
fuwhu Feb 3, 2020
41bdb7a
[SPARK-30718][BUILD] Exclude jdk.tools dependency from hadoop-yarn-api
dongjoon-hyun Feb 4, 2020
6097b34
[SPARK-30717][SQL] AQE subquery map should cache `SubqueryExec` inste…
maryannxue Feb 4, 2020
0202b67
[SPARK-26618][SQL][FOLLOWUP] Describe the behavior change of typed `T…
MaxGekk Feb 4, 2020
f2dd082
[SPARK-30725][SQL] Make legacy SQL configs as internal configs
MaxGekk Feb 4, 2020
7631275
[SPARK-25040][SQL][FOLLOWUP] Add legacy config for allowing empty str…
viirya Feb 5, 2020
8987169
Revert "[SPARK-28310][SQL] Support (FIRST_VALUE|LAST_VALUE)(expr[ (IG…
dongjoon-hyun Feb 5, 2020
692e3dd
[SPARK-27870][PYTHON][FOLLOW-UP] Rename spark.sql.pandas.udf.buffer.s…
HyukjinKwon Feb 5, 2020
e2d984a
[SPARK-30733][R][HOTFIX] Fix SparkR tests per testthat and R version …
HyukjinKwon Feb 5, 2020
30e418a
[SPARK-30594][CORE] Do not post SparkListenerBlockUpdated when update…
Ngone51 Feb 5, 2020
5983ad9
[SPARK-30506][SQL][DOC] Document for generic file source options/configs
Ngone51 Feb 5, 2020
86fdb81
[SPARK-30715][K8S] Bump fabric8 to 4.7.1
onursatici Feb 5, 2020
459e757
[SPARK-30668][SQL] Support `SimpleDateFormat` patterns in parsing tim…
MaxGekk Feb 5, 2020
6d507b4
[SPARK-26218][SQL][FOLLOW UP] Fix the corner case when casting float …
turboFei Feb 5, 2020
ec70e07
[MINOR][DOC] Add migration note for removing `org.apache.spark.ml.ima…
WeichenXu123 Feb 5, 2020
9d90c8b
[SPARK-30738][K8S] Use specific image version in "Launcher client dep…
dongjoon-hyun Feb 5, 2020
4938905
[SPARK-29864][SQL][FOLLOWUP] Reference the config for the old behavio…
xuanyuanking Feb 5, 2020
3b26f80
[SPARK-30721][SQL][TESTS] Fix DataFrameAggregateSuite when enabling AQE
cloud-fan Feb 5, 2020
f9f06ee
[SPARK-30122][K8S] Support spark.kubernetes.authenticate.executor.ser…
Feb 5, 2020
9d907bc
[SPARK-30743][K8S][TESTS] Use JRE instead of JDK in K8S test docker i…
dongjoon-hyun Feb 6, 2020
20c60a4
[MINOR][INFRA] Factor Python executable out as a variable in 'lint-py…
HyukjinKwon Feb 6, 2020
b95ccb1
[SPARK-30737][SPARK-27262][R][BUILD] Reenable CRAN check with UTF-8 e…
HyukjinKwon Feb 6, 2020
aebabf0
[SPARK-30729][CORE] Eagerly filter out zombie TaskSetManager before o…
Ngone51 Feb 6, 2020
c27a616
[SPARK-30612][SQL] Resolve qualified column name with v2 tables
imback82 Feb 6, 2020
c8ef1de
[SPARK-29108][SQL][TESTS][FOLLOWUP] Comment out no use test case and …
beliefer Feb 6, 2020
3f5b233
[SPARK-30744][SQL] Optimize AnalyzePartitionCommand by calculating lo…
Ngone51 Feb 6, 2020
368ee62
[SPARK-27297][DOC][FOLLOW-UP] Improve documentation for various Scala…
Ngone51 Feb 6, 2020
d861357
[SPARK-26700][CORE][FOLLOWUP] Add config `spark.network.maxRemoteBloc…
xuanyuanking Feb 6, 2020
8ce5862
[SPARK-30719][SQL] do not log warning if AQE is intentionally skipped…
cloud-fan Feb 6, 2020
5a4c70b
[SPARK-27986][SQL][FOLLOWUP] window aggregate function with filter pr…
cloud-fan Feb 6, 2020
4804445
[MINOR][DOC] Fix document UI left menu broken
xuanyuanking Feb 6, 2020
dd2f443
[MINOR][DOCS] Fix typos at python/pyspark/sql/types.py
sharifahmad2061 Feb 7, 2020
a3e7777
[SPARK-30752][SQL] Fix `to_utc_timestamp` on daylight saving day
MaxGekk Feb 7, 2020
a7451f4
[SPARK-30614][SQL] The native ALTER COLUMN syntax should change one p…
imback82 Feb 7, 2020
12e1bba
Revert "[SPARK-30642][SPARK-30659][SPARK-30660][SPARK-30662]"
zhengruifeng Feb 8, 2020
3db3e39
[SPARK-28228][SQL] Change the default behavior for name conflict in n…
xuanyuanking Feb 8, 2020
e1cd4d9
[SPARK-29587][DOC][FOLLOWUP] Add `SQL` tab in the `Data Types` page
xuanyuanking Feb 8, 2020
a7ae77a
[SPARK-30662][ML][PYSPARK] Put back the API changes for HasBlockSize …
huaxingao Feb 9, 2020
339c0f9
[SPARK-30510][SQL][DOCS] Publicly document Spark SQL configuration op…
nchammas Feb 9, 2020
b877aac
[SPARK-30684 ][WEBUI][FollowUp] A new approach for SPARK-30684
gengliangwang Feb 9, 2020
9f8172e
Revert "[SPARK-29721][SQL] Prune unnecessary nested fields from Gener…
viirya Feb 10, 2020
58b9ca1
[SPARK-30592][SQL][FOLLOWUP] Add some round-trip test cases
yaooqinn Feb 10, 2020
70e545a
[SPARK-30757][SQL][DOC] Update the doc on TableCatalog.alterTable's b…
imback82 Feb 10, 2020
5a24060
[SPARK-30719][SQL] Add unit test to verify the log warning print when…
Feb 10, 2020
b2011a2
[SPARK-30326][SQL] Raise exception if analyzer exceed max iterations
Eric5553 Feb 10, 2020
acfdb46
[SPARK-27946][SQL][FOLLOW-UP] Change doc and error message for SHOW C…
viirya Feb 10, 2020
4439b29
Revert "[SPARK-30245][SQL] Add cache for Like and RLike when pattern …
HyukjinKwon Feb 10, 2020
3c1c9b4
[SPARK-30759][SQL] Initialize cache for foldable patterns in StringRe…
MaxGekk Feb 10, 2020
a6b91d2
[SPARK-30556][SQL][FOLLOWUP] Reset the status changed in SQLExecution…
xuanyuanking Feb 10, 2020
e2ebca7
[SPARK-30779][SS] Fix some API issues found when reviewing Structured…
zsxwing Feb 10, 2020
07a9885
[SPARK-30777][PYTHON][TESTS] Fix test failures for Pandas >= 1.0.0
BryanCutler Feb 11, 2020
2bc765a
[SPARK-30756][SQL] Fix `ThriftServerWithSparkContextSuite` on spark-b…
HyukjinKwon Feb 11, 2020
0045be7
[SPARK-29462][SQL] The data type of "array()" should be array<null>
HyukjinKwon Feb 11, 2020
b20754d
[SPARK-27545][SQL][DOC] Update the Documentation for CACHE TABLE and …
iRakson Feb 11, 2020
f1d0dce
[MINOR][DOC] Add class document for PruneFileSourcePartitions and Pru…
fuwhu Feb 11, 2020
dc66d57
[SPARK-30754][SQL] Reuse results of floorDiv in calculations of floor…
MaxGekk Feb 11, 2020
ea626b6
[SPARK-30783] Exclude hive-service-rpc
yhuai Feb 11, 2020
99bd59f
[SPARK-29462][SQL][DOCS] Add some more context and details in 'spark.…
HyukjinKwon Feb 11, 2020
45db48e
Revert "[SPARK-30625][SQL] Support `escape` as third parameter of the…
MaxGekk Feb 11, 2020
b25359c
[SPARK-30780][SQL] Empty LocalTableScan should use RDD without partit…
hvanhovell Feb 12, 2020
aa6a605
[SPARK-30722][PYTHON][DOCS] Update documentation for Pandas UDF with …
HyukjinKwon Feb 12, 2020
b476999
[SPARK-30795][SQL] Spark SQL codegen's code() interpolator should tre…
rednaxelafx Feb 12, 2020
f5026b1
[SPARK-30763][SQL] Fix java.lang.IndexOutOfBoundsException No group 1…
beliefer Feb 12, 2020
8b18397
[SPARK-29542][FOLLOW-UP] Keep the description of spark.sql.files.* in…
turboFei Feb 12, 2020
c198620
[SPARK-30788][SQL] Support `SimpleDateFormat` and `FastDateFormat` as…
MaxGekk Feb 12, 2020
61b1e60
[SPARK-30759][SQL][TESTS][FOLLOWUP] Check cache initialization in Str…
MaxGekk Feb 12, 2020
5919bd3
[SPARK-30651][SQL] Add detailed information for Aggregate operators i…
Eric5553 Feb 12, 2020
aa0d136
[SPARK-30760][SQL] Port `millisToDays` and `daysToMillis` on Java 8 t…
MaxGekk Feb 12, 2020
5b76367
[SPARK-30797][SQL] Set tradition user/group/other permission to ACL e…
viirya Feb 12, 2020
496f6ac
[SPARK-29148][CORE] Add stage level scheduling dynamic allocation and…
tgravescs Feb 12, 2020
926e3a1
[SPARK-30790] The dataType of map() should be map<null,null>
iRakson Feb 13, 2020
453d526
[SPARK-30528][SQL] Turn off DPP subquery duplication by default
maryannxue Feb 13, 2020
a6b4b91
[SPARK-30613][SQL] Support Hive style REPLACE COLUMNS syntax
imback82 Feb 13, 2020
04604b9
[SPARK-30758][SQL][TESTS] Improve bracketed comments tests
beliefer Feb 13, 2020
fb0e07b
[SPARK-29231][SQL] Constraints should be inferred from cast equality …
wangyum Feb 13, 2020
82d0aa3
[SPARK-30762] Add dtype=float32 support to vector_to_array UDF
Feb 13, 2020
3c4044e
[SPARK-30703][SQL][DOCS] Add a document for the ANSI mode
maropu Feb 13, 2020
a4ceea6
[SPARK-30751][SQL] Combine the skewed readers into one in AQE skew jo…
cloud-fan Feb 13, 2020
8596991
[SPARK-30807][K8S][TESTS] Support Java 11 in K8S integration tests
dongjoon-hyun Feb 13, 2020
74cd46e
[SPARK-30816][K8S][TESTS] Fix dev-run-integration-tests.sh to ignore …
dongjoon-hyun Feb 13, 2020
e2d3983
[SPARK-30798][SQL] Scope Session.active in QueryExecution
dbaliafroozeh Feb 13, 2020
57254c9
[SPARK-30667][CORE] Add allGather method to BarrierTaskContext
sarthfrey Feb 14, 2020
fa3517c
Revert "[SPARK-30667][CORE] Add allGather method to BarrierTaskContext"
jiangxb1987 Feb 14, 2020
25db8c7
[PYSPARK][DOCS][MINOR] Changed `:func:` to `:attr:` Sphinx roles, fix…
DavidToneian Feb 14, 2020
0aed77a
[SPARK-30801][SQL] Subqueries should not be AQE-ed if main query is not
maryannxue Feb 14, 2020
b2134ee
[SPARK-30823][PYTHON][DOCS] Set `%PYTHONPATH%` when building PySpark …
DavidToneian Feb 14, 2020
99b8136
[SPARK-25990][SQL] ScriptTransformation should handle different data …
Ngone51 Feb 14, 2020
2a270a7
[SPARK-30810][SQL] Parses and convert a CSV Dataset having different …
HyukjinKwon Feb 14, 2020
7137a6d
[SPARK-30766][SQL] Fix the timestamp truncation to the `HOUR` and `DA…
MaxGekk Feb 14, 2020
b343757
[SPARK-29748][DOCS][FOLLOW-UP] Add a note that the legacy environment…
HyukjinKwon Feb 14, 2020
d273a2b
[SPARK-20628][CORE][K8S] Start to improve Spark decommissioning & pre…
holdenk Feb 14, 2020
d0f9614
[SPARK-30289][SQL] Partitioned by Nested Column for `InMemoryTable`
dbtsai Feb 14, 2020
8b73b92
[SPARK-30826][SQL] Respect reference case in `StringStartsWith` pushe…
MaxGekk Feb 15, 2020
f5238ea
[GRAPHX][MINOR] Fix typo setRest => setDest
xwu-intel Feb 16, 2020
0a03e7e
[SPARK-30691][SQL][DOC][FOLLOW-UP] Make link names exactly the same a…
huaxingao Feb 16, 2020
01cc852
[SPARK-30803][DOCS] Fix the home page link for Scala API document
xuanyuanking Feb 16, 2020
8ebbf85
[SPARK-30772][ML][SQL] avoid tuple assignment because it will circumv…
zhengruifeng Feb 16, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
5 changes: 5 additions & 0 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -1,2 +1,7 @@
*.bat text eol=crlf
*.cmd text eol=crlf
*.java text eol=lf
*.scala text eol=lf
*.xml text eol=lf
*.py text eol=lf
*.R text eol=lf
100 changes: 96 additions & 4 deletions .github/workflows/master.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,9 @@ on:
push:
branches:
- master
pull_request:
branches:
- master

jobs:
build:
Expand All @@ -12,16 +15,105 @@ jobs:
strategy:
matrix:
java: [ '1.8', '11' ]
name: Build Spark with JDK ${{ matrix.java }}
hadoop: [ 'hadoop-2.7', 'hadoop-3.2' ]
hive: [ 'hive-1.2', 'hive-2.3' ]
exclude:
- java: '11'
hive: 'hive-1.2'
- hadoop: 'hadoop-3.2'
hive: 'hive-1.2'
name: Build Spark - JDK${{ matrix.java }}/${{ matrix.hadoop }}/${{ matrix.hive }}

steps:
- uses: actions/checkout@master
# We split caches because GitHub Action Cache has a 400MB-size limit.
- uses: actions/cache@v1
with:
path: build
key: build-${{ hashFiles('**/pom.xml') }}
restore-keys: |
build-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/com
key: ${{ matrix.java }}-${{ matrix.hadoop }}-maven-com-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.java }}-${{ matrix.hadoop }}-maven-com-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/org
key: ${{ matrix.java }}-${{ matrix.hadoop }}-maven-org-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.java }}-${{ matrix.hadoop }}-maven-org-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/net
key: ${{ matrix.java }}-${{ matrix.hadoop }}-maven-net-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.java }}-${{ matrix.hadoop }}-maven-net-
- uses: actions/cache@v1
with:
path: ~/.m2/repository/io
key: ${{ matrix.java }}-${{ matrix.hadoop }}-maven-io-${{ hashFiles('**/pom.xml') }}
restore-keys: |
${{ matrix.java }}-${{ matrix.hadoop }}-maven-io-
- name: Set up JDK ${{ matrix.java }}
uses: actions/setup-java@v1
with:
version: ${{ matrix.java }}
java-version: ${{ matrix.java }}
- name: Build with Maven
run: |
export MAVEN_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=512m -Dorg.slf4j.simpleLogger.defaultLogLevel=WARN"
export MAVEN_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=1g -Dorg.slf4j.simpleLogger.defaultLogLevel=WARN"
export MAVEN_CLI_OPTS="--no-transfer-progress"
./build/mvn $MAVEN_CLI_OPTS -DskipTests -Pyarn -Pmesos -Pkubernetes -Phive -Phive-thriftserver -Phadoop-3.2 -Phadoop-cloud -Djava.version=${{ matrix.java }} package
mkdir -p ~/.m2
./build/mvn $MAVEN_CLI_OPTS -DskipTests -Pyarn -Pmesos -Pkubernetes -Phive -P${{ matrix.hive }} -Phive-thriftserver -P${{ matrix.hadoop }} -Phadoop-cloud -Djava.version=${{ matrix.java }} install
rm -rf ~/.m2/repository/org/apache/spark


lint:
runs-on: ubuntu-latest
name: Linters (Java/Scala/Python), licenses, dependencies
steps:
- uses: actions/checkout@master
- uses: actions/setup-java@v1
with:
java-version: '11'
- uses: actions/setup-python@v1
with:
python-version: '3.x'
architecture: 'x64'
- name: Scala
run: ./dev/lint-scala
- name: Java
run: ./dev/lint-java
- name: Python
run: |
pip install flake8 sphinx numpy
./dev/lint-python
- name: License
run: ./dev/check-license
- name: Dependencies
run: ./dev/test-dependencies.sh

lintr:
runs-on: ubuntu-latest
name: Linter (R)
steps:
- uses: actions/checkout@master
- uses: actions/setup-java@v1
with:
java-version: '11'
- name: install R
run: |
echo 'deb https://cloud.r-project.org/bin/linux/ubuntu bionic-cran35/' | sudo tee -a /etc/apt/sources.list
curl -sL "https://keyserver.ubuntu.com/pks/lookup?op=get&search=0xE298A3A825C0D65DFD57CBB651716619E084DAB9" | sudo apt-key add
sudo apt-get update
sudo apt-get install -y r-base r-base-dev libcurl4-openssl-dev
- name: install R packages
run: |
sudo Rscript -e "install.packages(c('curl', 'xml2', 'httr', 'devtools', 'testthat', 'knitr', 'rmarkdown', 'roxygen2', 'e1071', 'survival'), repos='https://cloud.r-project.org/')"
sudo Rscript -e "devtools::install_github('jimhester/lintr@v2.0.0')"
- name: package and install SparkR
run: ./R/install-dev.sh
- name: lint-r
run: ./dev/lint-r
24 changes: 24 additions & 0 deletions .github/workflows/stale.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
name: Close stale PRs

on:
schedule:
- cron: "0 0 * * *"

jobs:
stale:
runs-on: ubuntu-latest
steps:
- uses: actions/stale@v1.1.0
with:
repo-token: ${{ secrets.GITHUB_TOKEN }}
stale-pr-message: >
We're closing this PR because it hasn't been updated in a while.
This isn't a judgement on the merit of the PR in any way. It's just
a way of keeping the PR queue manageable.

If you'd like to revive this PR, please reopen it and ask a
committer to remove the Stale tag!
days-before-stale: 100
# Setting this to 0 is the same as setting it to 1.
# See: https://github.com/actions/stale/issues/28
days-before-close: 0
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -45,7 +45,7 @@ dev/create-release/*final
dev/create-release/*txt
dev/pr-deps/
dist/
docs/_site
docs/_site/
docs/api
sql/docs
sql/site
Expand All @@ -63,6 +63,7 @@ project/plugins/target/
python/lib/pyspark.zip
python/.eggs/
python/deps
python/docs/_site/
python/test_coverage/coverage_data
python/test_coverage/htmlcov
python/pyspark/python
Expand Down
3 changes: 2 additions & 1 deletion LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -216,6 +216,7 @@ core/src/main/resources/org/apache/spark/ui/static/bootstrap*
core/src/main/resources/org/apache/spark/ui/static/jsonFormatter*
core/src/main/resources/org/apache/spark/ui/static/vis*
docs/js/vendor/bootstrap.js
external/spark-ganglia-lgpl/src/main/java/com/codahale/metrics/ganglia/GangliaReporter.java


Python Software Foundation License
Expand Down Expand Up @@ -243,7 +244,7 @@ MIT License
core/src/main/resources/org/apache/spark/ui/static/dagre-d3.min.js
core/src/main/resources/org/apache/spark/ui/static/*dataTables*
core/src/main/resources/org/apache/spark/ui/static/graphlib-dot.min.js
ore/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/jquery*
core/src/main/resources/org/apache/spark/ui/static/sorttable.js
docs/js/vendor/anchor.min.js
docs/js/vendor/jquery*
Expand Down
Loading