Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MarkDuplicatesSpark fails on because of missing inteldeflater #1960

Closed
akiezun opened this issue Jun 30, 2016 · 3 comments
Closed

MarkDuplicatesSpark fails on because of missing inteldeflater #1960

akiezun opened this issue Jun 30, 2016 · 3 comments
Labels
Milestone

Comments

@akiezun
Copy link
Contributor

akiezun commented Jun 30, 2016

gcloud dataproc jobs submit spark --cluster markeddupe --properties spark.kryoserializer.buffer.max=512m,spark.driver.maxResultSize=0,spark.driver.userClassPathFirst=true,spark.io.compression.codec=lzf,spark.yarn.executor.memoryOverhead=600,spark.driver.extraJavaOptions=-Dsamjdk.intel_deflater_so_path=libIntelDeflater.so -Dsamjdk.compression_level=1 -DGATK_STACKTRACE_ON_USER_EXCEPTION=true ,spark.executor.extraJavaOptions=-Dsamjdk.intel_deflater_so_path=libIntelDeflater.so -Dsamjdk.compression_level=1 -DGATK_STACKTRACE_ON_USER_EXCEPTION=true ,spark.driver.memory=8G,spark.executor.cores=3,spark.executor.memory=25G,spark.yarn.executor.memoryOverhead=2500 --files /app/build/libIntelDeflater.so --jar /app/build/libs/gatk-all-4.alpha.1-159-gd0db982-SNAPSHOT-spark.jar MarkDuplicatesSpark --shardedOutput true -O /scratch/tmp.md.bam --numReducers 0 --apiKey <API_KEY> -I gs://broad-gatk-test-jenkins/CEUTrio.HiSeq.WEx.b37.NA12892.readnamesort.bam --sparkMaster yarn-client

related to #1903

@akiezun akiezun added the bug label Jun 30, 2016
@akiezun akiezun added this to the alpha-2 milestone Jun 30, 2016
@akiezun
Copy link
Contributor Author

akiezun commented Jul 1, 2016

@davidbernick for now, the solution is to remove -Dsamjdk.intel_deflater_so_path=libIntelDeflater.so and --files /app/build/libIntelDeflater.so from the commandlines

Closing this one. Will open another one for re-enabling intel deflater on spark

@akiezun akiezun closed this as completed Jul 1, 2016
@davidbernick
Copy link
Contributor

I'm running gatk-launch which generates this commandline. Will that be fixed so I can run gatk-launch with spark jobs?

@akiezun
Copy link
Contributor Author

akiezun commented Jul 5, 2016

yes it will be fixed. should be fixed now (via
#1964)

On Fri, Jul 1, 2016 at 3:17 PM, David Bernick [email protected]
wrote:

I'm running gatk-launch which generates this commandline. Will that be
fixed so I can run gatk-launch with spark jobs?


You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
#1960 (comment),
or mute the thread
https://github.com/notifications/unsubscribe/AB5rLwJM-Lz14JMdgCM6F8s8LjA7kjJWks5qRWfUgaJpZM4JCjTp
.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants