Skip to content

Commit 0b96d85

Browse files
committed
Merge pull request apache#399 from pwendell/consolidate-off
Disable shuffle file consolidation by default After running various performance tests for the 0.9 release, this still seems to have performance issues even on XFS. So let's keep this off-by-default for 0.9 and users can experiment with it depending on their disk configurations.
2 parents 0ab505a + 2802cc8 commit 0b96d85

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

core/src/main/scala/org/apache/spark/storage/ShuffleBlockManager.scala

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ class ShuffleBlockManager(blockManager: BlockManager) {
6464
// Turning off shuffle file consolidation causes all shuffle Blocks to get their own file.
6565
// TODO: Remove this once the shuffle file consolidation feature is stable.
6666
val consolidateShuffleFiles =
67-
conf.getBoolean("spark.shuffle.consolidateFiles", true)
67+
conf.getBoolean("spark.shuffle.consolidateFiles", false)
6868

6969
private val bufferSize = conf.getInt("spark.shuffle.file.buffer.kb", 100) * 1024
7070

docs/configuration.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -382,7 +382,7 @@ Apart from these, the following properties are also available, and may be useful
382382

383383
<tr>
384384
<td>spark.shuffle.consolidateFiles</td>
385-
<td>true</td>
385+
<td>false</td>
386386
<td>
387387
If set to "true", consolidates intermediate files created during a shuffle. Creating fewer files can improve filesystem performance for shuffles with large numbers of reduce tasks. It is recommended to set this to "true" when using ext4 or xfs filesystems. On ext3, this option might degrade performance on machines with many (>8) cores due to filesystem limitations.
388388
</td>

0 commit comments

Comments
 (0)