Commit efc9aa8
Usage docs for training recipes (#1630)
* Feature: Support GPU training recipes with Sagemaker Python SDK (#1516)
* v0 estimator for launching kandinksy training
* code cleanup
* option to over-ride git repos for kandinsky for testing purposes
* update dependencies
* update comment
* formatting fixes
* style fixes
* code cleanup
* Add warning messages for ingored arguments
* cleanup, address comments
* fix
* clone launcher repo only if necessary
* add a cleanup method to call after fit
* fix docstring
* fix warning
* cleanup update
* fix
* code style fix
* rename cleanup method for clarity
* missed change
* move cleanup to when object is destroyed
* add unit tests
* formatting fix
* removing tests which don't work as recipe repos are private
* removing tests which don't work as recipe repos are private
* resolve comments
* resolve comments
* Feature: Support Neuron training recipes. (#1526)
* Feature: Resolve recipes correctly before launching (#1529)
* fix to work with launcher recipes
* fix suffix for temp file
* fix path and error message
* fix for recipes from launcher
* resolve recipes correctly
* fix imports
* reformat message to avoid code-doc test issue
* code style fix
* code style fix
* code style fix
* code style fix
* code style fix
* code style fix
* code style fix
* code style fix
* code style fix
* doc formatting
* check if resolver exists before registering
* Feature: Add unit tests for recipes and minor bug fixes. (#1532)
* basic checks and unit test for recipes
* More testing for recipes. Move recipe overrides to top before accessing any recipe fields.
* check that we use customer provided image uri if it is set
* reformat
* test fixes
* update git urls for recipes
* revert to ssh git urls for recipes
* Feature: Move image uris and git repos for training recipes to json (#1547)
* Update MANIFEST.in so that wheel builds correctly (#1563)
* Remove default values for fields in recipe_overrides and fix recipe path. (#1566)
* add optional source dir for recipes, copy training code and requirements to source dir
* diff names for recipe file and local script option
* format and add unit test
* make entry point script and recipe file temp files that can be gced
* formatting and fix
* test fix
* test fixes
* format fix
* break function up because it is too long
* fixes
* fix
* fix
* remove references to launcher and adapter dir as we copy out everything needed into source dir
* reformat
* copy all directory contents for trainium as there is more than one source file
* fix
* fix
* remove debugging message
* Change default source directory to current, add option to specify source dir (#1593)
* update to public uris for hyperpod recipe repos and smp image
* fixes
* remove debug copies
* change caps for env vars
* skip some tests for now
* format
* neuron json for retrieving images
* update training_recipes.json
* add unit test
* reformat
* fix long line
* add source dir check when using training recipe
* adding more regions
* reformat
* doc update
* doc update
* doc update
* doc update
* fix capitalization issues
* fix capitalization issues
* doc check issue1 parent 36a8f0f commit efc9aa8
File tree
2 files changed
+131
-12
lines changed- doc/frameworks/pytorch
- src/sagemaker/pytorch
2 files changed
+131
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
| 24 | + | |
28 | 25 | | |
29 | | - | |
| 26 | + | |
30 | 27 | | |
31 | 28 | | |
32 | 29 | | |
| |||
175 | 172 | | |
176 | 173 | | |
177 | 174 | | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
178 | 185 | | |
179 | 186 | | |
180 | 187 | | |
| |||
196 | 203 | | |
197 | 204 | | |
198 | 205 | | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
199 | 209 | | |
| 210 | + | |
200 | 211 | | |
201 | | - | |
202 | | - | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
203 | 321 | | |
204 | 322 | | |
205 | 323 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
334 | 334 | | |
335 | 335 | | |
336 | 336 | | |
337 | | - | |
338 | | - | |
339 | | - | |
340 | | - | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
341 | 341 | | |
342 | | - | |
| 342 | + | |
| 343 | + | |
343 | 344 | | |
344 | 345 | | |
345 | 346 | | |
| |||
0 commit comments