Multi model deployment #208

TosinSeg · 2023-06-27T22:16:03Z

No description provided.

mii/deployment.py

mii/config.py

mii/models/score/score_template.py

examples/multi_model/query.py

mii/client.py

mii/grpc_related/modelresponse_server.py

mii/grpc_related/proto/modelresponse.proto

mii/models/score/score_template.py

mii/server.py

TosinSeg · 2023-07-11T22:56:56Z

mii/client.py

+    deployments = []
+    configs = mii.utils.import_score_file(deployment_tag).configs
+    for deployment in configs:
+        if not isinstance(configs[deployment], dict):


When the data is written to the score file it stores the dictionaries of all the deployments, along with the load balancer, model_path, and deployment_tag. The 'deployment' on line 18 in the for loop looks at all of them. So I determine if it is a model by checking if it is a dictionary

mii/models/score/generate.py

mii/client.py

TosinSeg · 2023-07-26T22:44:55Z

mii/deployment.py

+    if not deployments and not all((model, task, deployment_name)):
+        assert deployment_tag is not None, "Deployment tag must be set when starting empty deployment"
+        create_score_file(deployment_tag=deployment_tag,
+                          deployment_type=deployment_type,
+                          deployments=None,
+                          model_path=model_path,
+                          port_map=None,
+                          lb_config=None)
+        return None


I changed it and made a new function, let me know if you think it is better

TosinSeg · 2023-07-26T22:52:41Z

mii/config.py

+    mii_config: MIIConfig = MIIConfig.parse_obj({})
+    ds_config: dict = None
+    version: int = 1
+    deployed: bool = False


Removed the deployed option or make it hidden _deployed(verify this is hidden)

mrwyattii · 2023-08-01T20:44:51Z

examples/multi_model/deploy.py

+                         model=name,
+                         deployment_name=name + "_deployment",
+                         GPU_index_map=gpu_index_map3,
+                         mii_configs=mii.config.MIIConfig(**mii_configs1)))


Are we not able to pass just the dictionary here?

mii/client.py

mrwyattii · 2023-08-01T20:54:53Z

mii/client.py

+    deployments, lb_config, model_path, port_map = _get_deployment_configs(deployment_tag)
+    mii_configs = None
+    if len(deployments) > 0:
+        mii_configs = getattr(next(iter(deployments.values())),


We really need to address this problem. MIIConfig class is required for each DeploymentConfig but the MIIConfig contains options that affect all models (e.g., restful API port). I believe we need to refactor the configs a bit to resolve this and have model-specific configs separate from deployment-specific configs.

mrwyattii · 2023-08-01T20:57:04Z

mii/client.py

+            assert len(self.deployments) == 1, "Must pass deployment_name to query when using multiple deployments"
+            deployment = next(iter(self.deployments.values()))


If self.deployments has len of 1, then why do we need to do next(iter()) on it?

mrwyattii · 2023-08-01T21:18:18Z

mii/config.py

+
+
+class DeploymentConfig(BaseModel):
+    deployment_name: str = Field(alias="DEPLOYMENT_NAME_KEY")


I'm a little bit confused about these aliases. I know we discussed a change here last week, but I think there may have been some miscommunication.

mrwyattii · 2023-08-01T21:26:22Z

mii/client.py

+                   task=None,
+                   model=None,
+                   deployment_name=None,
+                   enable_deepspeed=True,
+                   enable_zero=False,
+                   ds_config=None,
+                   mii_config={},
+                   deployments=[],
+                   deployment_type=DeploymentType.LOCAL,
+                   model_path=None,
+                   version=1):


Since this is a new interface, we do not need to support both the old and new way of adding models. Let's just allow passing of deployments and not task, model, deployment_name

mrwyattii · 2023-08-01T21:27:34Z

mii/client.py

+    async def add_models_async(self, proto_request):
+        await getattr(self.lb_stub, "AddDeployment")(proto_request)
+
+    def add_models(self,


I'm a bit confused about the implementation of this method. It looks like we are re-generating the score file when we add models. And then we call init again? Does this not cause problems for models that are already running?

mii/client.py

Co-authored-by: Michael Wyatt <[email protected]>

TosinSeg added 18 commits June 19, 2023 23:59

Removing load balancing config

4eac006

Reformatting tests

c68e999

Fixed the formatting

5ce1a92

Removed print statement

fa10e19

Merging main

f9cbd74

Removing unused import

8970f4e

Fixing tests

517bea8

Fixing merge issue

58dd2b2

Creating hostfile when one is not provided

bb0d551

Merge branch 'main' into Always_enable_load_balancing

e2bb9d5

Fixing import statements removed by merge

3823534

Removing load_balancing check

6f9b4ad

Removing redudant definitions

499b9ad

Removing hostfile from test

5419ef6

Removing hostfile from non-persistent test

a70b6de

initial changes

eea658b

Merge branch 'main' into multi-model-deployment

20f0878

Maintaining current behavior

c21c31b

mrwyattii reviewed Jun 28, 2023

View reviewed changes

mii/deployment.py Outdated Show resolved Hide resolved

mrwyattii reviewed Jun 28, 2023

View reviewed changes

mii/config.py Outdated Show resolved Hide resolved

mrwyattii reviewed Jun 28, 2023

View reviewed changes

mii/models/score/score_template.py Outdated Show resolved Hide resolved

TosinSeg added 9 commits June 28, 2023 19:04

Reading from score file

f525329

fixing syntax errors

3c0937f

Fixing more syntax errors

156ac83

Fixing more syntax issues

38e270e

initial lb changes

4d4e0d8

Merge branch 'main' into multi-model-deployment

01c8e59

More load balancing changes

f801b36

LB changes and syntax

fd4e2ed

Refactor client, and unpack request in load balancer

0a3b7e5

Tosin Segun added 10 commits July 21, 2023 01:02

More partial deploy updates

c2636b7

Partial deploy started

189e75c

fixing add deploy api queries

adee843

Support for empty deployment 'group'

a145be5

Support for empty deployment 'group'

082c05e

Partial Termination

3ce77d2

Refactoring

b40ecbd

formatting

72dd95c

fixing bug for partial termination

a4e3d56

Removing comments

4b5bb47

mrwyattii reviewed Jul 26, 2023

View reviewed changes

Including GPU index map in score file

30d2b03

mrwyattii reviewed Jul 26, 2023

View reviewed changes

Refactoring deployment

c5d5996

TosinSeg commented Jul 26, 2023

View reviewed changes

Tosin Segun added 8 commits July 26, 2023 23:49

Refactoring and formatting

3ae1781

Refactoring

4b8f02f

Fixing Readme

c51ce37

Refactoring GRPC

43479db

Fixing LB process not terminating

e1b6d23

Adding multi_deployment and partial deploy/terminate unit tests

1675bd8

Removing comments

8684a61

Fixing spelling issues

56a7fce

mrwyattii reviewed Aug 1, 2023

View reviewed changes

TosinSeg and others added 4 commits August 1, 2023 14:41

Update mii/client.py

fb70c3d

Co-authored-by: Michael Wyatt <[email protected]>

Update mii/client.py

e2cfe8a

Co-authored-by: Michael Wyatt <[email protected]>

Removing AML from addDeploy

1312738

Refactoring MIIConfig and DeploymentConfig

b0f0da4

mrwyattii mentioned this pull request Aug 4, 2023

Refactor Configs #218

Merged

Partial deploy/termination example

b78068e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi model deployment #208

Multi model deployment #208

TosinSeg commented Jun 27, 2023

TosinSeg Jul 11, 2023

TosinSeg Jul 26, 2023

TosinSeg Jul 26, 2023

mrwyattii Aug 1, 2023

mrwyattii Aug 1, 2023

mrwyattii Aug 1, 2023

mrwyattii Aug 1, 2023

mrwyattii Aug 1, 2023

mrwyattii Aug 1, 2023

		assert len(self.deployments) == 1, "Must pass deployment_name to query when using multiple deployments"
		deployment = next(iter(self.deployments.values()))



		class DeploymentConfig(BaseModel):
		deployment_name: str = Field(alias="DEPLOYMENT_NAME_KEY")

Multi model deployment #208

Are you sure you want to change the base?

Multi model deployment #208

Conversation

TosinSeg commented Jun 27, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment