Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
3510 commits
Select commit Hold shift + click to select a range
3cdae0f
[SPARK-17638][STREAMING] Stop JVM StreamingContext when the Python pr…
zsxwing Sep 22, 2016
0d63487
[SPARK-17616][SQL] Support a single distinct aggregate combined with …
hvanhovell Sep 22, 2016
f4f6bd8
[SPARK-16240][ML] ML persistence backward compatibility for LDA
GayathriMurali Sep 22, 2016
a166196
[SPARK-17569][SPARK-17569][TEST] Make the unit test added for work again
brkyvz Sep 22, 2016
79159a1
[SPARK-17635][SQL] Remove hardcode "agg_plan" in HashAggregateExec
Sep 23, 2016
a4aeb76
[SPARK-17639][BUILD] Add jce.jar to buildclasspath when building.
Sep 23, 2016
947b8c6
[SPARK-16719][ML] Random Forests should communicate fewer trees on ea…
jkbradley Sep 23, 2016
62ccf27
[SPARK-17640][SQL] Avoid using -1 as the default batchId for FileStre…
zsxwing Sep 23, 2016
5c5396c
[BUILD] Closes some stale PRs
HyukjinKwon Sep 23, 2016
90d5754
[SPARK-16861][PYSPARK][CORE] Refactor PySpark accumulator API on top …
holdenk Sep 23, 2016
f89808b
[SPARK-17499][SPARKR][ML][MLLIB] make the default params in sparkR sp…
WeichenXu123 Sep 23, 2016
f62ddc5
[SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when…
zjffdu Sep 23, 2016
988c714
[SPARK-17643] Remove comparable requirement from Offset
marmbrus Sep 23, 2016
90a30f4
[SPARK-12221] add cpu time to metrics
jisookim0513 Sep 23, 2016
7c38252
[SPARK-17651][SPARKR] Set R package version number along with mvn
shivaram Sep 23, 2016
f3fe554
[SPARK-10835][ML] Word2Vec should accept non-null string array, in ad…
srowen Sep 24, 2016
248916f
[SPARK-17057][ML] ProbabilisticClassifierModels' thresholds should ha…
srowen Sep 24, 2016
7945dae
[MINOR][SPARKR] Add sparkr-vignettes.html to gitignore.
yanboliang Sep 24, 2016
de333d1
[SPARK-17551][SQL] Add DataFrame API for null ordering
xwu0226 Sep 25, 2016
59d87d2
[SPARK-17650] malformed url's throw exceptions before bricking Executors
brkyvz Sep 26, 2016
ac65139
[SPARK-17017][FOLLOW-UP][ML] Refactor of ChiSqSelector and add ML Pyt…
yanboliang Sep 26, 2016
50b89d0
[SPARK-14525][SQL] Make DataFrameWrite.save work for jdbc
JustinPihony Sep 26, 2016
f234b7c
[SPARK-16356][ML] Add testImplicits for ML unit tests and promote toDF()
HyukjinKwon Sep 26, 2016
bde85f8
[SPARK-17649][CORE] Log how many Spark events got dropped in LiveList…
zsxwing Sep 26, 2016
8135e0e
[SPARK-17153][SQL] Should read partition data when reading new files …
viirya Sep 26, 2016
7c7586a
[SPARK-17652] Fix confusing exception message while reserving capacity
sameeragarwal Sep 26, 2016
00be16d
[Docs] Update spark-standalone.md to fix link
ammills01 Sep 26, 2016
93c743f
[SPARK-17577][FOLLOW-UP][SPARKR] SparkR spark.addFile supports adding…
yanboliang Sep 26, 2016
6ee2842
Fix two comments since Actor is not used anymore.
Sep 27, 2016
85b0a15
[SPARK-15962][SQL] Introduce implementation with a dense format for U…
kiszk Sep 27, 2016
7f16aff
[SPARK-17138][ML][MLIB] Add Python API for multinomial logistic regre…
WeichenXu123 Sep 27, 2016
6a68c5d
[SPARK-16757] Set up Spark caller context to HDFS and YARN
weiqingy Sep 27, 2016
5de1737
[SPARK-16777][SQL] Do not use deprecated listType API in ParquetSchem…
HyukjinKwon Sep 27, 2016
2cac3b2
[SPARK-16516][SQL] Support for pushing down filters for decimal and t…
HyukjinKwon Sep 27, 2016
120723f
[SPARK-17682][SQL] Mark children as final for unary, binary, leaf exp…
rxin Sep 27, 2016
2ab24a7
[SPARK-17660][SQL] DESC FORMATTED for VIEW Lacks View Definition
gatorsmile Sep 27, 2016
67c7305
[SPARK-17677][SQL] Break WindowExec.scala into multiple files
rxin Sep 27, 2016
2f84a68
[SPARK-17618] Guard against invalid comparisons between UnsafeRow and…
JoshRosen Sep 27, 2016
e7bce9e
[SPARK-17056][CORE] Fix a wrong assert regarding unroll memory in Mem…
viirya Sep 27, 2016
b03b4ad
[SPARK-17666] Ensure that RecordReaders are closed by data source fil…
JoshRosen Sep 28, 2016
4a83395
[SPARK-17499][SPARKR][FOLLOWUP] Check null first for layers in spark.…
HyukjinKwon Sep 28, 2016
b2a7eed
[SPARK-17017][ML][MLLIB][ML][DOC] Updated the ml/mllib feature select…
lins05 Sep 28, 2016
2190037
[MINOR][PYSPARK][DOCS] Fix examples in PySpark documentation
HyukjinKwon Sep 28, 2016
46d1203
[SPARK-17644][CORE] Do not add failedStages when abortStage for fetch…
scwf Sep 28, 2016
a6cfa3f
[SPARK-17673][SQL] Incorrect exchange reuse with RowDataSourceScan
ericl Sep 28, 2016
557d6e3
[SPARK-17713][SQL] Move row-datasource related tests out of JDBCSuite
ericl Sep 28, 2016
7d09232
[SPARK-17641][SQL] Collect_list/Collect_set should not collect null v…
hvanhovell Sep 28, 2016
7dfad4b
[SPARK-17710][HOTFIX] Fix ClassCircularityError in ReplSuite tests in…
weiqingy Sep 29, 2016
37eb918
[SPARK-17712][SQL] Fix invalid pushdown of data-independent filters b…
JoshRosen Sep 29, 2016
a19a1bb
[SPARK-16356][FOLLOW-UP][ML] Enforce ML test of exception for local/d…
yanboliang Sep 29, 2016
f7082ac
[SPARK-17704][ML][MLLIB] ChiSqSelector performance improvement.
yanboliang Sep 29, 2016
b35b0db
[SPARK-17614][SQL] sparkSession.read() .jdbc(***) use the sql syntax …
srowen Sep 29, 2016
b2e9731
[MINOR][DOCS] Fix th doc. of spark-streaming with kinesis
maropu Sep 29, 2016
9582004
[DOCS] Reorganize explanation of Accumulators and Broadcast Variables
Sep 29, 2016
7f779e7
[SPARK-17648][CORE] TaskScheduler really needs offers to be an Indexe…
squito Sep 29, 2016
cb87b3c
[SPARK-17672] Spark 2.0 history server web Ui takes too long for a si…
wgtmac Sep 29, 2016
027dea8
[SPARK-17715][SCHEDULER] Make task launch logs DEBUG
bchocho Sep 29, 2016
fe33121
[SPARK-17699] Support for parsing JSON string columns
marmbrus Sep 29, 2016
566d7f2
[SPARK-17653][SQL] Remove unnecessary distincts in multiple unions
viirya Sep 29, 2016
4ecc648
[SPARK-17612][SQL] Support `DESCRIBE table PARTITION` SQL syntax
dongjoon-hyun Sep 29, 2016
29396e7
[SPARK-17721][MLLIB][ML] Fix for multiplying transposed SparseMatrix …
bwahlgreen Sep 29, 2016
3993ebc
[SPARK-17676][CORE] FsHistoryProvider should ignore hidden files
squito Sep 29, 2016
39eb3bb
[SPARK-17412][DOC] All test should not be run by `root` or any admin …
dongjoon-hyun Sep 29, 2016
2f73956
[SPARK-17697][ML] Fixed bug in summary calculations that pattern matc…
BryanCutler Sep 29, 2016
74ac1c4
[SPARK-17717][SQL] Add exist/find methods to Catalog.
hvanhovell Sep 30, 2016
1fad559
[SPARK-14077][ML] Refactor NaiveBayes to support weighted instances
zhengruifeng Sep 30, 2016
8e491af
[SPARK-14077][ML][FOLLOW-UP] Revert change for NB Model's Load to mai…
zhengruifeng Sep 30, 2016
f327e16
[SPARK-17738] [SQL] fix ARRAY/MAP in columnar cache
Sep 30, 2016
81455a9
[SPARK-17703][SQL] Add unnamed version of addReferenceObj for minor o…
ueshin Oct 1, 2016
a26afd5
[SPARK-15353][CORE] Making peer selection for block replication plugg…
shubhamchopra Oct 1, 2016
aef506e
[SPARK-17739][SQL] Collapse adjacent similar Window operators
dongjoon-hyun Oct 1, 2016
15e9bbb
[MINOR][DOC] Add an up-to-date description for default serialization …
dongjoon-hyun Oct 1, 2016
4bcd9b7
[SPARK-17740] Spark tests should mock / interpose HDFS to ensure that…
ericl Oct 1, 2016
af6ece3
[SPARK-17717][SQL] Add Exist/find methods to Catalog [FOLLOW-UP]
hvanhovell Oct 1, 2016
b88cb63
[SPARK-17704][ML][MLLIB] ChiSqSelector performance improvement.
srowen Oct 1, 2016
f8d7fad
[SPARK-17509][SQL] When wrapping catalyst datatype to Hive data type …
Oct 2, 2016
76dc2d9
[SPARK-14914][CORE][SQL] Skip/fix some test cases on Windows due to l…
taoli91 Oct 2, 2016
de3f71e
[SPARK-17598][SQL][WEB UI] User-friendly name for Spark Thrift Server…
ajbozarth Oct 3, 2016
a27033c
[SPARK-17736][DOCUMENTATION][SPARKR] Update R README for rmarkdown,…
jagadeesanas2 Oct 3, 2016
7bf9212
[SPARK-17073][SQL] generate column-level statistics
wzhfy Oct 3, 2016
1dd68d3
[SPARK-17718][DOCS][MLLIB] Make loss function formulation label note …
srowen Oct 3, 2016
1f31bda
[SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patch
Oct 3, 2016
d8399b6
[SPARK-17587][PYTHON][MLLIB] SparseVector __getitem__ should follow _…
zero323 Oct 4, 2016
2bbecde
[SPARK-17753][SQL] Allow a complex expression as the input a value ba…
hvanhovell Oct 4, 2016
c571cfb
[SPARK-17112][SQL] "select null" via JDBC triggers IllegalArgumentExc…
dongjoon-hyun Oct 4, 2016
b1b4727
[SPARK-17702][SQL] Code generation including too many mutable states …
ueshin Oct 4, 2016
d2dc8c4
[SPARK-17773] Input/Output] Add VoidObjectInspector
seyfe Oct 4, 2016
126baa8
[SPARK-17559][MLLIB] persist edges if their storage level is non in P…
Oct 4, 2016
8e8de00
[SPARK-17671][WEBUI] Spark 2.0 history server summary page is slow ev…
srowen Oct 4, 2016
7d51608
[SPARK-16962][CORE][SQL] Fix misaligned record accesses for SPARC arc…
sumansomasundar Oct 4, 2016
c17f971
[SPARK-17744][ML] Parity check between the ml and mllib test suites f…
zhengruifeng Oct 4, 2016
068c198
[SPARKR][DOC] minor formatting and output cleanup for R vignettes
felixcheung Oct 4, 2016
8d969a2
[SPARK-17549][SQL] Only collect table size stat in driver for cached …
Oct 4, 2016
a99743d
[SPARK-17495][SQL] Add Hash capability semantically equivalent to Hive's
tejasapatil Oct 5, 2016
c9fe10d
[SPARK-17658][SPARKR] read.df/write.df API taking path optionally in …
HyukjinKwon Oct 5, 2016
89516c1
[SPARK-17258][SQL] Parse scientific decimal literals as decimals
hvanhovell Oct 5, 2016
6a05eb2
[SPARK-17328][SQL] Fix NPE with EXPLAIN DESCRIBE TABLE
dongjoon-hyun Oct 5, 2016
9df54f5
[SPARK-17239][ML][DOC] Update user guide for multiclass logistic regr…
sethah Oct 5, 2016
221b418
[SPARK-17778][TESTS] Mock SparkContext to reduce memory usage of Bloc…
zsxwing Oct 5, 2016
5fd54b9
[SPARK-17758][SQL] Last returns wrong result in case of empty partition
hvanhovell Oct 5, 2016
9293734
[SPARK-17346][SQL] Add Kafka source for Structured Streaming
zsxwing Oct 5, 2016
b678e46
[SPARK-17346][SQL][TEST-MAVEN] Generate the sql test jar to fix the m…
zsxwing Oct 6, 2016
7aeb20b
[MINOR][ML] Avoid 2D array flatten in NB training.
yanboliang Oct 6, 2016
5e9f32d
[BUILD] Closing some stale PRs
HyukjinKwon Oct 6, 2016
92b7e57
[SPARK-17750][SQL] Fix CREATE VIEW with INTERVAL arithmetic.
dongjoon-hyun Oct 6, 2016
79accf4
[SPARK-17798][SQL] Remove redundant Experimental annotations in sql.s…
rxin Oct 6, 2016
9a48e60
[SPARK-17780][SQL] Report Throwable to user in StreamExecution
zsxwing Oct 6, 2016
49d11d4
[SPARK-17803][TESTS] Upgrade docker-client dependency
ckadner Oct 6, 2016
3713bb1
[SPARK-17792][ML] L-BFGS solver for linear regression does not accept…
sethah Oct 7, 2016
bcaa799
[SPARK-17805][PYSPARK] Fix in sqlContext.read.text when pass in list …
BryanCutler Oct 7, 2016
18bf9d2
[SPARK-17782][STREAMING][BUILD] Add Kafka 0.10 project to build modules
hvanhovell Oct 7, 2016
24097d8
[SPARK-17795][WEB UI] Sorting on stage or job tables doesn’t reload p…
ajbozarth Oct 7, 2016
2b01d3c
[SPARK-16960][SQL] Deprecate approxCountDistinct, toDegrees and toRad…
HyukjinKwon Oct 7, 2016
e56614c
[SPARK-16827] Stop reporting spill metrics as shuffle metrics
bchocho Oct 7, 2016
dd16b52
[SPARK-17800] Introduce InterfaceStability annotation
rxin Oct 7, 2016
cff5607
[SPARK-17707][WEBUI] Web UI prevents spark-submit application to be f…
srowen Oct 7, 2016
aa3a684
[SPARK-14525][SQL][FOLLOWUP] Clean up JdbcRelationProvider
HyukjinKwon Oct 7, 2016
bb1aaf2
[SPARK-16411][SQL][STREAMING] Add textFile to Structured Streaming.
ScrapCodes Oct 7, 2016
9d8ae85
[SPARK-17665][SPARKR] Support options/mode all for read/write APIs an…
HyukjinKwon Oct 7, 2016
2badb58
[SPARK-15621][SQL] Support spilling for Python UDF
Oct 7, 2016
97594c2
[SPARK-17761][SQL] Remove MutableRow
hvanhovell Oct 7, 2016
94b24b8
[SPARK-17806] [SQL] fix bug in join key rewritten in HashJoin
Oct 7, 2016
24850c9
[HOTFIX][BUILD] Do not use contains in Option in JdbcRelationProvider
HyukjinKwon Oct 8, 2016
471690f
[MINOR][ML] remove redundant comment in LogisticRegression
wangmiao1981 Oct 8, 2016
362ba4b
[SPARK-17793][WEB UI] Sorting on the description on the Job or Stage …
ajbozarth Oct 8, 2016
4201ddc
[SPARK-17768][CORE] Small (Sum,Count,Mean)Evaluator problems and subo…
srowen Oct 8, 2016
8a6bbe0
[MINOR][SQL] Use resource path for test_script.sh
weiqingy Oct 8, 2016
26fbca4
[SPARK-17832][SQL] TableIdentifier.quotedString creates un-parseable …
jiangxb1987 Oct 10, 2016
1659003
[SPARK-17741][SQL] Grammar to parse top level and nested data fields …
jiangxb1987 Oct 10, 2016
23ddff4
[SPARK-17338][SQL] add global temp view
cloud-fan Oct 10, 2016
7e16c94
[HOT-FIX][SQL][TESTS] Remove unused function in `SparkSqlParserSuite`
jiangxb1987 Oct 10, 2016
4bafaca
[SPARK-17417][CORE] Fix # of partitions for Reliable RDD checkpointing
dhruve Oct 10, 2016
689de92
[SPARK-17830] Annotate spark.sql package with InterfaceStability
rxin Oct 10, 2016
3f8a022
[SPARK-17828][DOCS] Remove unused generate-changelist.py
a-roberts Oct 10, 2016
29f186b
[SPARK-14082][MESOS] Enable GPU support with Mesos
tnachen Oct 10, 2016
03c4020
[SPARK-14610][ML] Remove superfluous split for continuous features in…
sethah Oct 11, 2016
d5ec4a3
[SPARK-17738][TEST] Fix flaky test in ColumnTypeSuite
Oct 11, 2016
90217f9
[SPARK-16896][SQL] Handle duplicated field names in header consistent…
HyukjinKwon Oct 11, 2016
19a5bae
[SPARK-17816][CORE] Fix ConcurrentModificationException issue in Bloc…
seyfe Oct 11, 2016
0c0ad43
[SPARK-17719][SPARK-17776][SQL] Unify and tie up options in a single …
HyukjinKwon Oct 11, 2016
b515768
[SPARK-17844] Simplify DataFrame API for defining frame boundaries in…
rxin Oct 11, 2016
19401a2
[SPARK-15957][ML] RFormula supports forcing to index label
yanboliang Oct 11, 2016
658c714
[SPARK-17808][PYSPARK] Upgraded version of Pyrolite to 4.13
BryanCutler Oct 11, 2016
7388ad9
[SPARK-17338][SQL][FOLLOW-UP] add global temp view
cloud-fan Oct 11, 2016
3694ba4
[SPARK-17864][SQL] Mark data type APIs as stable (not DeveloperApi)
rxin Oct 11, 2016
c8c0906
[SPARK-17821][SQL] Support And and Or in Expression Canonicalize
viirya Oct 11, 2016
75b9e35
[SPARK-17346][SQL][TESTS] Fix the flaky topic deletion in KafkaSource…
zsxwing Oct 11, 2016
07508bd
[SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly S…
viirya Oct 11, 2016
23405f3
[SPARK-15153][ML][SPARKR] Fix SparkR spark.naiveBayes error when labe…
yanboliang Oct 11, 2016
5b77e66
[SPARK-17387][PYSPARK] Creating SparkContext() from python without sp…
zjffdu Oct 11, 2016
b9a1471
[SPARK-17720][SQL] introduce static SQL conf
cloud-fan Oct 12, 2016
299eb04
Fix hadoop.version in building-spark.md
apivovarov Oct 12, 2016
b512f04
[SPARK-17880][DOC] The url linking to `AccumulatorV2` in the document…
sarutak Oct 12, 2016
c264ef9
[SPARK-17853][STREAMING][KAFKA][DOC] make it clear that reusing group…
koeninger Oct 12, 2016
8d33e1e
[SPARK-11560][MLLIB] Optimize KMeans implementation / remove 'runs'
srowen Oct 12, 2016
8880fd1
[SPARK-14761][SQL] Reject invalid join methods when join columns are …
Oct 12, 2016
d5580eb
[SPARK-17884][SQL] To resolve Null pointer exception when casting fro…
priyankagar Oct 12, 2016
5cc503f
[SPARK-17790][SPARKR] Support for parallelizing R data.frame larger t…
falaki Oct 12, 2016
f8062b6
[SPARK-17840][DOCS] Add some pointers for wiki/CONTRIBUTING.md in REA…
srowen Oct 12, 2016
eb69335
[BUILD] Closing stale PRs
srowen Oct 12, 2016
47776e7
[SPARK-17850][CORE] Add a flag to ignore corrupt files
zsxwing Oct 12, 2016
9ce7d3e
[SPARK-17675][CORE] Expand Blacklist for TaskSets
squito Oct 12, 2016
f9a56a1
[SPARK-17782][STREAMING][KAFKA] alternative eliminate race condition …
koeninger Oct 12, 2016
6f20a92
[SPARK-17845] [SQL] More self-evident window function frame boundary API
rxin Oct 12, 2016
0d4a695
[SPARK-17745][ML][PYSPARK] update NB python api - add weight col para…
WeichenXu123 Oct 13, 2016
21cb59f
[SPARK-17835][ML][MLLIB] Optimize NaiveBayes mllib wrapper to elimina…
yanboliang Oct 13, 2016
edeb51a
[SPARK-17876] Write StructuredStreaming WAL to a stream instead of ma…
brkyvz Oct 13, 2016
064d665
[SPARK-17866][SPARK-17867][SQL] Fix Dataset.dropduplicates
viirya Oct 13, 2016
7222a25
minor doc fix for Row.scala
david-weiluo-ren Oct 13, 2016
6f2fa6c
[SPARK-11272][WEB UI] Add support for downloading event logs from His…
ajbozarth Oct 13, 2016
db8784f
[SPARK-17899][SQL] add a debug mode to keep raw table properties in H…
cloud-fan Oct 13, 2016
7bf8a40
[SPARK-17686][CORE] Support printing out scala and java version with …
jerryshao Oct 13, 2016
0a8e51a
[SPARK-17657][SQL] Disallow Users to Change Table Type
gatorsmile Oct 13, 2016
04d417a
[SPARK-17830][SQL] Annotate remaining SQL APIs with InterfaceStability
rxin Oct 13, 2016
84f149e
[SPARK-17827][SQL] maxColLength type should be Int for String and Binary
robbinspg Oct 13, 2016
08eac35
[SPARK-17834][SQL] Fetch the earliest offsets manually in KafkaSource…
zsxwing Oct 13, 2016
7106866
[SPARK-17731][SQL][STREAMING] Metrics for structured streaming
tdas Oct 13, 2016
adc1124
[SPARK-17661][SQL] Consolidate various listLeafFiles implementations
petermaxlee Oct 13, 2016
9dc0ca0
[SPARK-17368][SQL] Add support for value class serialization and dese…
jodersky Oct 14, 2016
44cbb61
[SPARK-15957][FOLLOW-UP][ML][PYSPARK] Add Python API for RFormula for…
yanboliang Oct 14, 2016
8543996
[SPARK-17927][SQL] Remove dead code in WriterContainer.
rxin Oct 14, 2016
6c29b3d
[SPARK-17925][SQL] Break fileSourceInterfaces.scala into multiple pieces
rxin Oct 14, 2016
2fb12b0
[SPARK-17903][SQL] MetastoreRelation should talk to external catalog …
cloud-fan Oct 14, 2016
1db8fea
[SPARK-15402][ML][PYSPARK] PySpark ml.evaluation should support save/…
yanboliang Oct 14, 2016
a1b136d
[SPARK-14634][ML] Add BisectingKMeansSummary
zhengruifeng Oct 14, 2016
c8b612d
[SPARK-17870][MLLIB][ML] Change statistic to pValue for SelectKBest a…
Oct 14, 2016
28b645b
[SPARK-17855][CORE] Remove query string from jar url
invkrh Oct 14, 2016
7486442
[SPARK-17073][SQL][FOLLOWUP] generate column-level statistics
Oct 14, 2016
a0ebcb3
[DOC] Fix typo in sql hive doc
dhruve Oct 14, 2016
fa37877
Typo: form -> from
ash211 Oct 14, 2016
05800b4
[TEST] Ignore flaky test in StreamingQueryListenerSuite
tdas Oct 14, 2016
de1c1ca
[SPARK-17941][ML][TEST] Logistic regression tests should use sample w…
sethah Oct 14, 2016
7ab8624
[SPARK-17620][SQL] Determine Serde by hive.default.fileformat when Cr…
dilipbiswal Oct 14, 2016
522dd0d
Revert "[SPARK-17620][SQL] Determine Serde by hive.default.fileformat…
yhuai Oct 14, 2016
da9aeb0
[SPARK-17863][SQL] should not add column into Distinct
Oct 14, 2016
5aeb738
[SPARK-16063][SQL] Add storageLevel to Dataset
Oct 14, 2016
f00df40
[SPARK-11775][PYSPARK][SQL] Allow PySpark to register Java UDF
zjffdu Oct 14, 2016
72adfbf
[SPARK-17900][SQL] Graduate a list of Spark SQL APIs to stable
rxin Oct 14, 2016
2d96d35
[SPARK-17946][PYSPARK] Python crossJoin API similar to Scala
srinathshankar Oct 15, 2016
6ce1b67
[SPARK-16980][SQL] Load only catalog table partition metadata require…
Oct 15, 2016
36d81c2
[SPARK-17953][DOCUMENTATION] Fix typo in SparkSession scaladoc
tae-jun Oct 15, 2016
ed14633
[SPARK-17637][SCHEDULER] Packed scheduling for Spark tasks across exe…
Oct 16, 2016
72a6e7a
Revert "[SPARK-17637][SCHEDULER] Packed scheduling for Spark tasks ac…
rxin Oct 16, 2016
59e3eb5
[SPARK-17819][SQL] Support default database in connection URIs for Sp…
dongjoon-hyun Oct 17, 2016
e18d02c
[SPARK-17947][SQL] Add Doc and Comment about spark.sql.debug
gatorsmile Oct 17, 2016
56b0f5f
[MINOR][SQL] Add prettyName for current_database function
weiqingy Oct 17, 2016
e3bf37f
Fix example of tf_idf with minDocFreq
maximerihouey Oct 17, 2016
c7ac027
[SPARK-17839][CORE] Use Nio's directbuffer instead of BufferedInputSt…
Oct 17, 2016
d88a1ba
[SPARK-17751][SQL] Remove spark.sql.eagerAnalysis and Output the Plan…
gatorsmile Oct 17, 2016
813ab5e
[SPARK-17620][SQL] Determine Serde by hive.default.fileformat when Cr…
dilipbiswal Oct 18, 2016
8daa1a2
[SPARK-17974] Refactor FileCatalog classes to simplify the inheritanc…
ericl Oct 18, 2016
1c5a7d7
Revert "[SPARK-17974] Refactor FileCatalog classes to simplify the in…
rxin Oct 18, 2016
7d878cf
[SQL][STREAMING][TEST] Fix flaky tests in StreamingQueryListenerSuite
lw-lin Oct 18, 2016
a9e79a4
[SQL][STREAMING][TEST] Follow up to remove Option.contains for Scala …
tdas Oct 18, 2016
e59df62
[SPARK-17899][SQL][FOLLOW-UP] debug mode should work for corrupted table
cloud-fan Oct 18, 2016
3768653
[SPARK-17388] [SQL] Support for inferring type date/timestamp/decimal…
HyukjinKwon Oct 18, 2016
231f39e
[SPARK-17711] Compress rolled executor log
loneknightpy Oct 18, 2016
4ef39c2
[SPARK-17974] try 2) Refactor FileCatalog classes to simplify the inh…
ericl Oct 18, 2016
bfe7885
[SPARK-17985][CORE] Bump commons-lang3 version to 3.5.
ueshin Oct 18, 2016
20dd110
[MINOR][DOC] Add more built-in sources in sql-programming-guide.md
weiqingy Oct 18, 2016
4518642
[SPARK-17930][CORE] The SerializerInstance instance used when deseria…
witgo Oct 18, 2016
b3130c7
[SPARK-17955][SQL] Make DataFrameReader.jdbc call DataFrameReader.for…
HyukjinKwon Oct 18, 2016
cd662bc
Revert "[SPARK-17985][CORE] Bump commons-lang3 version to 3.5."
rxin Oct 18, 2016
cd106b0
[SPARK-17841][STREAMING][KAFKA] drain commitQueue
koeninger Oct 18, 2016
1e35e96
[SPARK-17817] [PYSPARK] [FOLLOWUP] PySpark RDD Repartitioning Results…
viirya Oct 18, 2016
941b3f9
[SPARK-17731][SQL][STREAMING][FOLLOWUP] Refactored StreamingQueryList…
tdas Oct 19, 2016
5f20ae0
[SPARK-17980][SQL] Fix refreshByPath for converted Hive tables
ericl Oct 19, 2016
2629cd7
[SPARK-17711][TEST-HADOOP2.2] Fix hadoop2.2 compilation error
loneknightpy Oct 19, 2016
4329c5c
[SPARK-17873][SQL] ALTER TABLE RENAME TO should allow users to specif…
cloud-fan Oct 19, 2016
f39852e
[SPARK-18001][DOCUMENT] fix broke link to SparkDataFrame
Wenpei Oct 19, 2016
9540357
[SPARK-17985][CORE] Bump commons-lang3 version to 3.5.
ueshin Oct 19, 2016
444c2d2
[SPARK-10541][WEB UI] Allow ApplicationHistoryProviders to provide th…
ajbozarth Oct 19, 2016
4b2011e
[SPARK-17989][SQL] Check ascendingOrder type in sort_array function r…
HyukjinKwon Oct 20, 2016
f313117
[SPARK-18012][SQL] Simplify WriterContainer
rxin Oct 20, 2016
3975516
[SPARK-18003][SPARK CORE] Fix bug of RDD zipWithIndex & zipWithUnique…
WeichenXu123 Oct 20, 2016
4bd17c4
[SPARK-17991][SQL] Enable metastore partition pruning by default.
ericl Oct 20, 2016
c2c107a
[SPARK-11653][DEPLOY] Allow spark-daemon.sh to run in the foreground
mikejihbe Oct 20, 2016
986a3b8
[SPARK-17796][SQL] Support wildcard character in filename for LOAD DA…
dongjoon-hyun Oct 20, 2016
e895bc2
[SPARK-17860][SQL] SHOW COLUMN's database conflict check should respe…
dilipbiswal Oct 20, 2016
fb0894b
[SPARK-17698][SQL] Join predicates should not contain filter clauses
tejasapatil Oct 20, 2016
84b245f
[SPARK-15780][SQL] Support mapValues on KeyValueGroupedDataset
koertkuipers Oct 20, 2016
947f4f2
[SPARK-17999][KAFKA][SQL] Add getPreferredLocations for KafkaSourceRDD
jerryshao Oct 20, 2016
7f9ec19
[SPARK-18021][SQL] Refactor file name specification for data sources
rxin Oct 20, 2016
2d14ab7
[DOCS] Update docs to not suggest to package Spark before running tests.
markgrover Oct 20, 2016
1bb99c4
[SPARK-18030][TESTS] Adds more checks to collect more info about File…
zsxwing Oct 21, 2016
3180272
[SPARKR] fix warnings
felixcheung Oct 21, 2016
57e97fc
[SPARK-18029][SQL] PruneFileSourcePartitions should not change the ou…
cloud-fan Oct 21, 2016
595893d
[SPARK-17960][PYSPARK][UPGRADE TO PY4J 0.10.4]
jagadeesanas2 Oct 21, 2016
a8ea4da
[SPARK-17331][FOLLOWUP][ML][CORE] Avoid allocating 0-length arrays
zhengruifeng Oct 21, 2016
3a23751
[SPARK-13275][WEB UI] Visually clarified executors start time in time…
ajbozarth Oct 21, 2016
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
10 changes: 10 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
## What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

## How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark before opening a pull request.
105 changes: 56 additions & 49 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,80 +1,87 @@
*~
*.#*
*#*#
*.swp
*.ipr
*.#*
*.iml
*.ipr
*.iws
*.pyc
*.pyo
*.swp
*~
.DS_Store
.cache
.classpath
.ensime
.ensime_cache/
.ensime_lucene
.generated-mima*
.idea/
.idea_modules/
build/*.jar
.project
.pydevproject
.scala_dependencies
.settings
.cache
cache
.generated-mima*
work/
out/
.DS_Store
third_party/libmesos.so
third_party/libmesos.dylib
/lib/
R-unit-tests.log
R/unit-tests.out
R/cran-check.out
R/pkg/vignettes/sparkr-vignettes.html
build/*.jar
build/apache-maven*
build/zinc*
build/scala*
conf/java-opts
conf/*.sh
build/zinc*
cache
checkpoint
conf/*.cmd
conf/*.properties
conf/*.conf
conf/*.properties
conf/*.sh
conf/*.xml
conf/java-opts
conf/slaves
dependency-reduced-pom.xml
derby.log
dev/create-release/*final
dev/create-release/*txt
dist/
docs/_site
docs/api
target/
reports/
.project
.classpath
.scala_dependencies
lib_managed/
src_managed/
lint-r-report.log
log/
logs/
out/
project/boot/
project/plugins/project/build.properties
project/build/target/
project/plugins/target/
project/plugins/lib_managed/
project/plugins/project/build.properties
project/plugins/src_managed/
logs/
log/
project/plugins/target/
python/lib/pyspark.zip
reports/
scalastyle-on-compile.generated.xml
scalastyle-output.xml
scalastyle.txt
spark-*-bin-*.tgz
spark-tests.log
src_managed/
streaming-tests.log
dependency-reduced-pom.xml
.ensime
.ensime_cache/
.ensime_lucene
checkpoint
derby.log
dist/
dev/create-release/*txt
dev/create-release/*final
spark-*-bin-*.tgz
target/
unit-tests.log
/lib/
rat-results.txt
scalastyle.txt
scalastyle-output.xml
R-unit-tests.log
R/unit-tests.out
python/lib/pyspark.zip
lint-r-report.log
work/

# For Hive
metastore_db/
metastore/
warehouse/
TempStatsStore/
metastore/
metastore_db/
sql/hive-thriftserver/test_warehouses
warehouse/
spark-warehouse/

# For R session data
.RHistory
.RData
.RHistory
.Rhistory
*.Rproj
*.Rproj.*

.Rproj.user
88 changes: 0 additions & 88 deletions .rat-excludes

This file was deleted.

51 changes: 51 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# Spark provides this Travis CI configuration file to help contributors
# check Scala/Java style conformance and JDK7/8 compilation easily
# during their preparing pull requests.
# - Scalastyle is executed during `maven install` implicitly.
# - Java Checkstyle is executed by `lint-java`.
# See the related discussion here.
# https://github.com/apache/spark/pull/12980

# 1. Choose OS (Ubuntu 14.04.3 LTS Server Edition 64bit, ~2 CORE, 7.5GB RAM)
sudo: required
dist: trusty

# 2. Choose language and target JDKs for parallel builds.
language: java
jdk:
- oraclejdk7
- oraclejdk8

# 3. Setup cache directory for SBT and Maven.
cache:
directories:
- $HOME/.sbt
- $HOME/.m2

# 4. Turn off notifications.
notifications:
email: false

# 5. Run maven install before running lint-java.
install:
- export MAVEN_SKIP_RC=1
- build/mvn -T 4 -q -DskipTests -Pmesos -Pyarn -Phadoop-2.3 -Pkinesis-asl -Phive -Phive-thriftserver install

# 6. Run lint-java.
script:
- dev/lint-java
2 changes: 1 addition & 1 deletion CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ It lists steps that are required before creating a PR. In particular, consider:

- Is the change important and ready enough to ask the community to spend time reviewing?
- Have you searched for existing, related JIRAs and pull requests?
- Is this a new feature that can stand alone as a package on http://spark-packages.org ?
- Is this a new feature that can stand alone as a [third party project](https://cwiki.apache.org/confluence/display/SPARK/Third+Party+Projects) ?
- Is the change being proposed clearly explained and motivated?

When you contribute code, you affirm that the contribution is your original work and that you
Expand Down
34 changes: 20 additions & 14 deletions LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -237,8 +237,8 @@ The text of each license is also included at licenses/LICENSE-[project].txt.

(BSD 3 Clause) netlib core (com.github.fommil.netlib:core:1.1.2 - https://github.com/fommil/netlib-java/core)
(BSD 3 Clause) JPMML-Model (org.jpmml:pmml-model:1.2.7 - https://github.com/jpmml/jpmml-model)
(BSD 3-clause style license) jblas (org.jblas:jblas:1.2.4 - http://jblas.org/)
(BSD License) AntLR Parser Generator (antlr:antlr:2.7.7 - http://www.antlr.org/)
(BSD License) ANTLR 4.5.2-1 (org.antlr:antlr4:4.5.2-1 - http://wwww.antlr.org/)
(BSD licence) ANTLR ST4 4.0.4 (org.antlr:ST4:4.0.4 - http://www.stringtemplate.org)
(BSD licence) ANTLR StringTemplate (org.antlr:stringtemplate:3.2.1 - http://www.stringtemplate.org)
(BSD License) Javolution (javolution:javolution:5.5.1 - http://javolution.org)
Expand All @@ -249,22 +249,21 @@ The text of each license is also included at licenses/LICENSE-[project].txt.
(Interpreter classes (all .scala files in repl/src/main/scala
except for Main.Scala, SparkHelper.scala and ExecutorClassLoader.scala),
and for SerializableMapWrapper in JavaUtils.scala)
(BSD-like) Scala Actors library (org.scala-lang:scala-actors:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-compiler:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-reflect:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scala Library (org.scala-lang:scala-library:2.10.5 - http://www.scala-lang.org/)
(BSD-like) Scalap (org.scala-lang:scalap:2.10.5 - http://www.scala-lang.org/)
(BSD-style) scalacheck (org.scalacheck:scalacheck_2.10:1.10.0 - http://www.scalacheck.org)
(BSD-style) spire (org.spire-math:spire_2.10:0.7.1 - http://spire-math.org)
(BSD-style) spire-macros (org.spire-math:spire-macros_2.10:0.7.1 - http://spire-math.org)
(New BSD License) Kryo (com.esotericsoftware.kryo:kryo:2.21 - http://code.google.com/p/kryo/)
(New BSD License) MinLog (com.esotericsoftware.minlog:minlog:1.2 - http://code.google.com/p/minlog/)
(New BSD License) ReflectASM (com.esotericsoftware.reflectasm:reflectasm:1.07 - http://code.google.com/p/reflectasm/)
(BSD-like) Scala Actors library (org.scala-lang:scala-actors:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-compiler:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Compiler (org.scala-lang:scala-reflect:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scala Library (org.scala-lang:scala-library:2.11.7 - http://www.scala-lang.org/)
(BSD-like) Scalap (org.scala-lang:scalap:2.11.7 - http://www.scala-lang.org/)
(BSD-style) scalacheck (org.scalacheck:scalacheck_2.11:1.10.0 - http://www.scalacheck.org)
(BSD-style) spire (org.spire-math:spire_2.11:0.7.1 - http://spire-math.org)
(BSD-style) spire-macros (org.spire-math:spire-macros_2.11:0.7.1 - http://spire-math.org)
(New BSD License) Kryo (com.esotericsoftware:kryo:3.0.3 - https://github.com/EsotericSoftware/kryo)
(New BSD License) MinLog (com.esotericsoftware:minlog:1.3.0 - https://github.com/EsotericSoftware/minlog)
(New BSD license) Protocol Buffer Java API (com.google.protobuf:protobuf-java:2.5.0 - http://code.google.com/p/protobuf)
(New BSD license) Protocol Buffer Java API (org.spark-project.protobuf:protobuf-java:2.4.1-shaded - http://code.google.com/p/protobuf)
(The BSD License) Fortran to Java ARPACK (net.sourceforge.f2j:arpack_combined_all:0.1 - http://f2j.sourceforge.net)
(The BSD License) xmlenc Library (xmlenc:xmlenc:0.52 - http://xmlenc.sourceforge.net)
(The New BSD License) Py4J (net.sf.py4j:py4j:0.9.1 - http://py4j.sourceforge.net/)
(The New BSD License) Py4J (net.sf.py4j:py4j:0.10.4 - http://py4j.sourceforge.net/)
(Two-clause BSD-style license) JUnit-Interface (com.novocode:junit-interface:0.10 - https://github.com/szeiger/junit-interface/)
(BSD licence) sbt and sbt-launch-lib.bash
(BSD 3 Clause) d3.min.js (https://github.com/mbostock/d3/blob/master/LICENSE)
Expand All @@ -283,11 +282,18 @@ The text of each license is also included at licenses/LICENSE-[project].txt.
(MIT License) SLF4J API Module (org.slf4j:slf4j-api:1.7.5 - http://www.slf4j.org)
(MIT License) SLF4J LOG4J-12 Binding (org.slf4j:slf4j-log4j12:1.7.5 - http://www.slf4j.org)
(MIT License) pyrolite (org.spark-project:pyrolite:2.0.1 - http://pythonhosted.org/Pyro4/)
(MIT License) scopt (com.github.scopt:scopt_2.10:3.2.0 - https://github.com/scopt/scopt)
(MIT License) scopt (com.github.scopt:scopt_2.11:3.2.0 - https://github.com/scopt/scopt)
(The MIT License) Mockito (org.mockito:mockito-core:1.9.5 - http://www.mockito.org)
(MIT License) jquery (https://jquery.org/license/)
(MIT License) AnchorJS (https://github.com/bryanbraun/anchorjs)
(MIT License) graphlib-dot (https://github.com/cpettitt/graphlib-dot)
(MIT License) dagre-d3 (https://github.com/cpettitt/dagre-d3)
(MIT License) sorttable (https://github.com/stuartlangridge/sorttable)
(MIT License) boto (https://github.com/boto/boto/blob/develop/LICENSE)
(MIT License) datatables (http://datatables.net/license)
(MIT License) mustache (https://github.com/mustache/mustache/blob/master/LICENSE)
(MIT License) cookies (http://code.google.com/p/cookies/wiki/License)
(MIT License) blockUI (http://jquery.malsup.com/block/)
(MIT License) RowsGroup (http://datatables.net/license/mit)
(MIT License) jsonFormatter (http://www.jqueryscript.net/other/jQuery-Plugin-For-Pretty-JSON-Formatting-jsonFormatter.html)
(MIT License) modernizr (https://github.com/Modernizr/Modernizr/blob/master/LICENSE)
Loading