Snap 293 #310

nthanvi · 2016-07-22T05:05:06Z

Changes proposed in this pull request

Added support for dynamic Jar loading in snappy. The idea here is to support the existing snappy command for loading the jar file. For that lead node driver and executor class loader are modified in such a way that they look for the jar in the snappy-store if it is not found in regular class path.
Apart form above change there is another problem that once the job is submitted through the job server and if user make any changes to the jar related to submitted application and try to rerun again then executors are not able to pick the new jar, they pick the old jar as they are long running and old jar is still in ClassPath. to fix this issue , a new DynamicURLClassLoader implementation is done which maintains a Loader per Jar.
The current implementation of SnappySQLJOB and JavaSnappySQLJob.java is kind of inconsistent in sence of method to overload and return types.
Now only two methods are exposed to the user runSnappyJob and isValidJob with a single return Type SnappyValidation so user has a small learning curve and limited entry points.
all the existing jobs are modified to use the new Method Implementations of SnappySQLJob and SnappyStreamingJob etc.
code formatting for the existing jobs.

Patch testing

dunit , junit , clean precheckin with store

ReleaseNotes.txt changes

No. but will update the related *md files for same.

Other PRs

TIBCOSoftware/snappy-spark-jobserver#1
TIBCOSoftware/snappy-spark#39
TIBCOSoftware/snappy-store#88

Now user need to implement isValidJob and runSnappyJob methods for all jobs added contextClassLoader for driver and client thread. it wil enable it to read the classes from the snappy store repository.

it handles executor class loader per jar file. still need to implement clientfirst classloader

it supports both childFirst and parentFirst jarLoading added basic test coverage

added full JUnit coverage for DynamicURLClassLoader

with Junit and DUnit coverage

SnappySQLJob can be used for same purpose

sumwale · 2016-08-11T09:13:12Z

cluster/src/main/scala/org/apache/spark/executor/SnappyExecutor.scala

+
+}
+
+


This override can also be added for IsolatedClientLoader.classLoader. It is used for any external JDBC URLs apart from our gemxd drivers (e.g. if user wishes to use it to load data into gemxd). If it is not added there, we need docs on how user should go about doing it (perhaps JDBC drivers have to be in SPARK_DIST_CLASSPATH while other parts will be fine with dynamic install-jar).

I will remove the following line
private val overwriteFiles = env.conf.getBoolean("spark.files.overwrite", false)

as it is not needed after the changes in the job server. The Job server always create a new jar file where we have added our own suffix in the name so now executor knows that the jar belongs to the same job and it can reload it. executors can also copy this jar as it is a new jar for them.

sumwale · 2016-08-11T09:14:19Z

@rishitesh Please review the Java job API changes in this.

sumwale · 2016-08-11T09:17:14Z

cluster/src/main/java/org/apache/spark/streaming/JavaSnappyStreamingJob.java

+public abstract class JavaSnappyStreamingJob implements SparkJobBase {

-  abstract public  Object runJavaJob(JavaSnappyStreamingContext snc, Config jobConfig);
+  abstract public Object runSnappyJob(JavaSnappyStreamingContext snc, Config jobConfig);


Why this rename? I suppose the idea was to emphasize that the code needs to use the Java wrapper JavaSnappyStreamingContext (and not the regular SnappyStreamingContext). @rishitesh your thoughts?

It was more to do with type resolution in case of scala generic type. The API proposed is looking cleaner if its running all the tests. As use only have to deal with two methods. Only input types will change.

sumwale · 2016-08-11T09:35:55Z

cluster/src/main/scala/org/apache/spark/util/SnappyUtils.scala

+    else new File(tempDir, jarName.format(System.currentTimeMillis()))
+    TestUtils.createJar(files1 ++ files2, jarFile)
+  }
+


Since this is being used only be tests, move this to io.snappydata.util.TestUtils

sumwale · 2016-08-11T09:45:20Z

Haven't looked at the tests yet but looked through pretty much rest of the code. Please ensure we have tests for:
a) install/replace/drop from job-server (is drop possible from job-server?)
b) install/replace/drop using the usual GemXD procedures
c) install/replace/drop of JDBC drivers

We also need docs for how user should install/replace/drop jars as well as JDBC drivers which is common case users try to load import data into snappydata. If c) is not possible, then we need to document and test that too (SPARK_DIST_CLASSPATH)

nthanvi · 2016-08-24T05:20:06Z

Added another pull request after incorporating the review comments and 2.0 merge.
#337

For some open issues with PR open the Jira https://jira.snappydata.io/browse/SNAP-999?filter=-1

nthanvi added 14 commits July 13, 2016 13:08

made the Job implementation consistent across the classes.

da5373e

Now user need to implement isValidJob and runSnappyJob methods for all jobs added contextClassLoader for driver and client thread. it wil enable it to read the classes from the snappy store repository.

Added utility method for getting ContextLoader impl

2ffce11

initial implementation of executor classloader

66c8e7e

it handles executor class loader per jar file. still need to implement clientfirst classloader

Merge remote-tracking branch 'origin/HEAD' into SNAP-293

3dca0ed

initial commit for dynamic URL loading support

451b882

it supports both childFirst and parentFirst jarLoading added basic test coverage

Added full testing coverage for DynamicURLClassLoader

d73fc7c

added full JUnit coverage for DynamicURLClassLoader

try to load class from the parent

1071de8

adding the dunit

08eed39

Merge remote-tracking branch 'origin/master' into SNAP-293

d4ae45b

changes required for dynamic Jar loading

bf7d61f

with Junit and DUnit coverage

ynchronizing with spark , spark-jobserver and store

3fc761a

JavaSnappySQLJob is not required now

04bdb74

SnappySQLJob can be used for same purpose

provide a single user interface to write the jobs

327bdef

attempt to remove the jar file when not needed.

1d1f87d

nthanvi assigned sumwale Jul 22, 2016

This was referenced Jul 22, 2016

adding a suffix to jar name TIBCOSoftware/snappy-spark-jobserver#1

Closed

Snap 293 TIBCOSoftware/snappy-store#88

Merged

nthanvi added 3 commits July 22, 2016 10:43

synchronizing store

1b999fd

documentation changes for the job server job api

6bf1d2f

appropriate messages are added to the classloader

2ed90a8

sumwale reviewed Aug 11, 2016
View reviewed changes

nthanvi mentioned this pull request Aug 12, 2016

Uniform API for Job Server Interfaces #330

Closed

nthanvi closed this Aug 24, 2016

sumwale deleted the SNAP-293 branch December 5, 2016 22:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Snap 293 #310

Snap 293 #310

Uh oh!

nthanvi commented Jul 22, 2016

Uh oh!

sumwale Aug 11, 2016

Uh oh!

nthanvi Aug 11, 2016

Uh oh!

sumwale commented Aug 11, 2016

Uh oh!

sumwale Aug 11, 2016

Uh oh!

rishitesh Aug 12, 2016

Uh oh!

sumwale Aug 11, 2016

Uh oh!

sumwale commented Aug 11, 2016

Uh oh!

nthanvi commented Aug 24, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Snap 293 #310

Snap 293 #310

Uh oh!

Conversation

nthanvi commented Jul 22, 2016

Changes proposed in this pull request

Patch testing

ReleaseNotes.txt changes

Other PRs

Uh oh!

sumwale Aug 11, 2016

Choose a reason for hiding this comment

Uh oh!

nthanvi Aug 11, 2016

Choose a reason for hiding this comment

Uh oh!

sumwale commented Aug 11, 2016

Uh oh!

sumwale Aug 11, 2016

Choose a reason for hiding this comment

Uh oh!

rishitesh Aug 12, 2016

Choose a reason for hiding this comment

Uh oh!

sumwale Aug 11, 2016

Choose a reason for hiding this comment

Uh oh!

sumwale commented Aug 11, 2016

Uh oh!

nthanvi commented Aug 24, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants