Spot conf parser #142

natedogs911 · 2016-11-22T16:03:12Z

Moving hdfs_setup and ml_ops to python scripts instead of bash to support new spot.conf
now all variables are being stored in spot.conf, including ingest configurations.

I have left the normal bash scripts for comparison and testing this round.

rabarona · 2016-11-22T19:18:58Z

spot-ml/ml_ops.py

+        "--conf spark.executor.cores=" + SPK_EXEC_CORES,
+        "--conf spark.executor.memory=" + SPK_EXEC_MEM,
+        "--conf spark.driver.maxResultSize=" + SPK_DRIVER_MAX_RESULTS,
+        "--conf spark.yarn.driver.memoryOverhead=" + SPK_DRIVER_MEM_OVERHEAD,


spark.yarn.driver.memoryOverhead should be spark.yarn.am.memoryOverhead based on current spot branch.

rabarona · 2016-11-22T19:21:07Z

spot-setup/spot.conf

+SPK_DRIVER_MAX_RESULTS=
+SPK_EXEC_CORES=
+SPK_DRIVER_MEM_OVERHEAD=
+SPK_EXEC_MEM_OVERHEAD=


Thanks you fixed it.

rabarona · 2016-11-22T19:27:11Z

spot-ml/ml_ops.py

+        TOL = conf.get('DEFAULT','TOL')
+
+    #prepare options for spark-submit
+    spark_cmd = [


Some of the values in spark_cmd and spark_extras are either modified or gone in spot branch.

i'm working on rebasing my branch right now to resolve this and the conflicts

there was only the one change to the actual spark command that i could find. let me know if there is anything else, pushing the changes now.

natedogs911 · 2016-11-23T01:34:16Z

Now that we are reviewing this...

Obviously these vars are not being used, is there any reason to keep them around?

PREPROCESS_STEP = "{0}_pre_lda".format(args.type)
POSTPROCESS_STEP = "{0}_post_lda".format(args.type)

HDFS_DOCRESULTS = "{0}/doc_results.csv".format(HPATH)
LOCAL_DOCRESULTS = "{0}/doc_results.csv".format(LPATH)

HDFS_WORDRESULTS = "{0}/word_results.csv".format(HPATH)
LOCAL_WORDRESULTS = "{0}/word_results.csv".format(LPATH)

LDA_OUTPUT_DIR = "{1}/{1}".format(args.type, args.fdate)

rabarona · 2016-11-23T15:22:28Z

I don't see any reason to keep those.

natedogs911 added 14 commits November 3, 2016 17:04

moving spot.conf to .INI file structure

4d9b78e

convert hdfs_set.sh to .py

98b933a

reformat spot.conf

6462293

organized by components, changed DSOURCE to TYPE

773e2a1

fixed DSOURCE-->TYPE

975e364

switched python cmd to ConfigParser

d60c487

added hdfs_setup.py

4138a47

added ml_ops.py for more advanced useage

68c3cee

switched ingest to spot.conf and ConfigParser

2e8b8fc

change OA utils.py to standard ConfigParser, remove .replace

7ba5e24

updated notebooks with ConfigParser

651c0b1

bug fixed

3e4f2d4

updated ml_ops.py with model variable

480891e

added new ingest confs

eb2df88

rabarona suggested changes Nov 22, 2016

View reviewed changes

natedogs911 added 7 commits November 22, 2016 13:22

bug fixes

fd8ddd0

made ml_ops.py executable

29f4c19

updated dns pipeline to ConfigParser

6f2be08

update flow pipeline to ConfigParser

83691e5

update proxy pipeline with ConfigParser

1c892d6

add new options for ingest

bdd5faf

added lines to spot.conf

4f6af02

natedogs911 added 2 commits November 22, 2016 17:40

code check of ml_ops.py, updated spark options

7f99a67

removed hdfs_setup.sh

d643c53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spot conf parser #142

Spot conf parser #142

natedogs911 commented Nov 22, 2016

rabarona Nov 22, 2016

natedogs911 Nov 23, 2016

rabarona Nov 22, 2016

rabarona Nov 22, 2016

natedogs911 Nov 22, 2016

natedogs911 Nov 23, 2016

natedogs911 commented Nov 23, 2016

rabarona commented Nov 23, 2016

Spot conf parser #142

Are you sure you want to change the base?

Spot conf parser #142

Conversation

natedogs911 commented Nov 22, 2016

rabarona Nov 22, 2016

Choose a reason for hiding this comment

natedogs911 Nov 23, 2016

Choose a reason for hiding this comment

rabarona Nov 22, 2016

Choose a reason for hiding this comment

rabarona Nov 22, 2016

Choose a reason for hiding this comment

natedogs911 Nov 22, 2016

Choose a reason for hiding this comment

natedogs911 Nov 23, 2016

Choose a reason for hiding this comment

natedogs911 commented Nov 23, 2016

rabarona commented Nov 23, 2016