LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #291

JackUrb · 2022-05-11T22:10:13Z

Master PR to track progress against the phase 1 of the light refactor, which roughly coincides with parts of #296. These changes are critical for consolidating the LIGHT codebase and having it ready for both research and production use.

Much of this will be handled in follow-up PRs that merge in. The current plan:
New Data Model

(THIS PR's first commit) outline core classes with stubs, define database schemas
Implementing New Users Tables #292 (approved, not merged)
New Episode logging #293 (Episode DB + InteractionLoggers)
New environment db #295 (EnvDB and associated testing)
Using EpisodeDB in main game path #297 (Port main game to write to Interactions DB when new episodes are created)
Using UserDB as main game identity storage #298 (Port main game to use users DB for users)

Distributed deploy

Creating and using the ModelPool #300 (Model Pool completion)
Creating LIGHT's ModelServer #302 (Using remote models)
asyncio all over LIGHT #304 (No longer synchronous)
AWS Option for LIGHT data model storage #305 (Able to launch both local and production now)
Stable Server commit before coming refactors #306 (Small deploy fixes caught on frontend)
Privacy commitment improvements #307 (Approvals-related changes)

Elements that are part of Phase 2/3 that won't be tracked here.

Port graph builders to use EnvironmentDB
Standardize and centralize all of our data, populating these DBs
Update build scripts to pull from the new databases
Deprecate the old LIGHT formats

small intro fix (actor_name)

Pre alpha fixes2

fixed scroll issues in side bar and main page

on_use events are totally broken

add missing missions

More backend and event fixes pre-launch

Stars

Mission entry

* Bump parlai * soften pytest * Test soul messing up testing * Another mis-test * Accidentally dropped file * Stop confusing the tests * another one snuck by

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Addressing comments, clarifying code

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Enums for model types

…LIGHT into new-data-model

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Creating LIGHT's ModelServer * Undo server change * tornado simplicity * Handling for inline candidate models * But regular models should also work without this * Enums for model types

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Creating LIGHT's ModelServer * Undo server change * tornado simplicity * Handling for inline candidate models * But regular models should also work without this * Async... all of the things... * Async the server too * clearing up async server tests * Correct async mock * internalize init_world * clean up tornado usage * small GameInstance bug * small GameInstance bug * Some deploy fixes * Moving safety model to async part * Some safety fixes * test fixes * Enums for model types * Checking for non-list to convert first

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Creating LIGHT's ModelServer * Undo server change * tornado simplicity * Handling for inline candidate models * But regular models should also work without this * Async... all of the things... * Async the server too * clearing up async server tests * Correct async mock * internalize init_world * clean up tornado usage * small GameInstance bug * small GameInstance bug * Some deploy fixes * now using aws as a storage backend * Moving safety model to async part * Some safety fixes * test fixes * silly elif fix

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Creating LIGHT's ModelServer * Undo server change * tornado simplicity * Handling for inline candidate models * But regular models should also work without this * Async... all of the things... * Async the server too * clearing up async server tests * Correct async mock * internalize init_world * clean up tornado usage * small GameInstance bug * small GameInstance bug * Some deploy fixes * now using aws as a storage backend * Moving safety model to async part * Some safety fixes * test fixes * silly elif fix * Taking bug-fixes from stable server * Model server changes too * Skip another web test, works in prod, refactor incoming * Enums for model types * Checking for non-list to convert first * Slightly more clarity

* Some initial transitions over to model pool * Moving initialization code out from where it occurred * Wiring more of the system together * Adding opt for reranked generative * Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB * Initial episode data model * Update content loggers to use episode formatting * Updating tables to work with testing * Fixing some test changes * Fixing small warnings that were noise during tests * Moving default log path * Test fix * Correcting math thanks to Kurt * Updating env DB classes to SQLAlchemy * Name keys and Elems coded * Adding arbitrary node attributes * First complete pass of EnvDB * Mypy fixings * Fixing agents * Writing some tests * Finishing tests for object and room creates and queries * Edge testing * Arbitrary attributes testing * Quests and testing * And finally, DBGraph tests * fixing episode change * TODO function * final mypy fixes * DBID testing * a -> either a or an depending on aeiou * adding WorldConfig to hold complex configuration vars * Moving episode_db into relevant GraphBuilders * Game launches, but not logging * Local BaseDB, now saving episodes * Missing files * deleting miscommit * test fix * Migrating to UserDB * No more LIGHTDatabase in TornadoServer * Fixing tests * Works after testing locally * Updated messaging for unimplemented * Upgrading OneRoomGraphBuilder to ModelPool * Completing (almost) the rest of Modelpool references * Works without loading models in play_map * Model pool actually works * Safety working as well * removing prints * Fixing some tests, skipping starspace * Runs on server too * Creating LIGHT's ModelServer * Undo server change * tornado simplicity * Handling for inline candidate models * But regular models should also work without this * Async... all of the things... * Async the server too * clearing up async server tests * Correct async mock * internalize init_world * clean up tornado usage * small GameInstance bug * small GameInstance bug * Some deploy fixes * now using aws as a storage backend * Moving safety model to async part * Some safety fixes * test fixes * silly elif fix * Taking bug-fixes from stable server * Model server changes too * Skip another web test, works in prod, refactor incoming * Ensure we're not using FB user data * Methods for scrubbing the datasets * Deleting player data and related graph info * Fixing bugs, adding tests * Environment exporting * Export episode DB too * 60 != 90 * Addressing comments

…LIGHT into new-data-model

JackUrb and others added 30 commits April 29, 2021 11:38

Merge pull request #202 from facebookresearch/bugz4

87ab8f1

small intro fix (actor_name)

fixed undefined actor error in message component

5a4d048

added checks for props before any use of to upper

0feee62

fixed system message agent header

795044e

Merge pull request #203 from facebookresearch/pre-alpha-fixes2

128038d

Pre alpha fixes2

fixed scroll issues in side bar and main page

612a56a

Merge pull request #204 from facebookresearch/pre-alpha-fixes2

a4e6db2

fixed scroll issues in side bar and main page

add missing missions

bc72898

added star shine animation

8dac513

added star shine animation

32a0639

events are totally broken

9ed557c

Merge pull request #206 from facebookresearch/bugz6

0e1e8a9

on_use events are totally broken

Merge pull request #205 from facebookresearch/bugz5

1c4e996

add missing missions

Fixes and enhancements to use_events

25f7c78

styled soul spawn entry

2f2c3da

Another round of backend fixes

cc25aab

Also fix debug logging?

04e2184

Skip dialogue safety check on DEBUG

357b40a

Usage limit to scrolls

07b77df

Merge pull request #208 from facebookresearch/more-event-fixes

cbbfd51

More backend and event fixes pre-launch

added check for quest completion to reducer

dbf1d7c

added Mission success entry and total exp to progress bar

be06c4c

fixed border on soul spawn event entry

64e1bf8

adding copy to tutorial page

912f47e

added images to experience points topic

c293791

adding images to experience points system topic

2a4f76f

removed glowing effect from character description on soul spawn event

5ec2481

Merge pull request #207 from facebookresearch/stars

830faed

Stars

Merge branch 'master' into mission-entry

9d23f32

Merge pull request #209 from facebookresearch/mission-entry

119d10b

Mission entry

Alex-Gurung added 2 commits June 13, 2022 08:58

updates for whole pipline

cf7a6c6

cleanup, deleted a file

255f1ef

JackUrb mentioned this pull request Jul 22, 2022

New environment db #295

Merged

JackUrb added 8 commits July 28, 2022 16:59

Rebuild on main, committed (#299)

173d21f

Merge branch 'main' into new-data-model

a06a3a9

requirements

d23811e

Fix tests on main (#301)

67bd4ce

* Bump parlai * soften pytest * Test soul messing up testing * Another mis-test * Accidentally dropped file * Stop confusing the tests * another one snuck by

Merge branch 'main' into new-data-model

6577160

Implementing New Users Tables (#292)

183449b

* Filling out UserDB * Abstract enforce get first * Clearer argument - num_turns * Using enums in DB

JackUrb changed the title ~~New LIGHT Data Model~~ LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy Aug 22, 2022

JackUrb added 15 commits August 22, 2022 12:57

Fixing bad merge

c05132b

Merge branch 'new-data-model' of https://github.com/facebookresearch/…

643e1b2

…LIGHT into new-data-model

dropped change on merge

baad3d4

Small fixes for episode db keys

fa347e2

Merge branch 'new-data-model' of https://github.com/facebookresearch/…

a349b1f

…LIGHT into new-data-model

world dissociation

0978216

Small episode test cleanup

bf5b04a

JackUrb closed this Aug 31, 2022

JackUrb force-pushed the main branch from 921f7e5 to d853d19 Compare August 31, 2022 15:34

JackUrb mentioned this pull request Aug 31, 2022

LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #325

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #291

LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #291

JackUrb commented May 11, 2022 •

edited

Loading

LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #291

LIGHT Refactor Phase 1 - New Data Model + Distributed Deploy #291

Conversation

JackUrb commented May 11, 2022 • edited Loading

JackUrb commented May 11, 2022 •

edited

Loading