Add AOAI O1 Preview specific endpoint #31

ilmarinen · 2024-09-25T01:48:33Z

Add a RestEndpointO1PreviewModelsAzure class that captures the specifics needed for hitting the o1-preview endpoint in Azure.
Add rate limiting logic at the model level.

safooray · 2024-09-26T20:54:28Z

eureka_ml_insights/models/models.py

+    def __post_init__(self):
+        self.bearer_token_provider = get_bearer_token_provider(AzureCliCredential(), "https://cognitiveservices.azure.com/.default")
+
+        @sleep_and_retry


Could you please add this to the parent class so all endpoint models can benefit from it?
I would say it should be optional though, so if the user does not provide ratelimit related argument, default behavior should be without rate limiting.

safooray · 2024-09-26T20:56:03Z

eureka_ml_insights/models/models.py

+        def check_call_limit(*args, **kwargs):
+            return None
+
+        self.check_call_limit = check_call_limit


why are you setting this instead of making self the first arg of the check_call_limit method like any regular member method?

safooray · 2024-09-26T21:00:57Z

eureka_ml_insights/models/models.py

+
+        @sleep_and_retry
+        @limits(calls=self.calls, period=self.period)
+        def check_call_limit(*args, **kwargs):


Why add this extra method that doesn't do anything instead of directly decorating the method that makes the API call?

safooray · 2024-09-26T21:01:45Z

eureka_ml_insights/models/models.py

+            # Print the headers - they include the requert ID and the timestamp, which are useful for debugging.
+            logging.info(e.info())
+            logging.info(e.read().decode("utf8", "ignore"))
+        return None, False, False


Please use inheritance to avoid repeating existing code.

safooray · 2024-09-26T21:02:28Z

eureka_ml_insights/models/models.py

+        if system_message:
+            data["messages"]= [{"role": "system", "content": system_message}] + data["input_data"][
+                "input_string"
+            ]


This should throw a key error because data["input_data"] does not exist.

safooray · 2024-09-26T21:03:31Z

eureka_ml_insights/models/models.py

@@ -9,6 +9,8 @@
 import anthropic
 from azure.identity import AzureCliCredential, get_bearer_token_provider

+from ratelimit import limits, sleep_and_retry


please make sure to run the linters according to the contribution instructions.

safooray · 2024-09-26T21:04:03Z

setup.py

@@ -39,6 +39,7 @@
        'bitsandbytes>=0.42.0',
        'accelerate>=0.21.0',
        'pycocotools>=2.0.8',
+        'ratelimit>=2.2.1',


please add this to the conda environment yml file as well.

ilmarinen added 2 commits September 24, 2024 18:47

Add AOAI O1 Preview specific endpoint

52e89b5

Add rate limit logic

476d54d

ilmarinen requested a review from safooray September 25, 2024 15:57

safooray requested changes Sep 27, 2024

View reviewed changes

ilmarinen added 6 commits September 30, 2024 14:36

Move rate limit logic to ModelEndpoint class

27691b8

Add params to method signature

04e5d51

Rate limit get_response method instead of generate

2de4a16

Make sure that post_inits get chained

85f6547

Checkpoint code for running Eureka in AzureML

5dc2fb7

Add model config

79f09d6

safooray closed this Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AOAI O1 Preview specific endpoint #31

Add AOAI O1 Preview specific endpoint #31

ilmarinen commented Sep 25, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading

safooray Sep 26, 2024

safooray Sep 26, 2024

safooray Sep 26, 2024

safooray Sep 26, 2024

Add AOAI O1 Preview specific endpoint #31

Add AOAI O1 Preview specific endpoint #31

Conversation

ilmarinen commented Sep 25, 2024 • edited Loading

safooray Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

safooray Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

safooray Sep 26, 2024 • edited Loading

Choose a reason for hiding this comment

safooray Sep 26, 2024

Choose a reason for hiding this comment

safooray Sep 26, 2024

Choose a reason for hiding this comment

safooray Sep 26, 2024

Choose a reason for hiding this comment

safooray Sep 26, 2024

Choose a reason for hiding this comment

ilmarinen commented Sep 25, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading

safooray Sep 26, 2024 •

edited

Loading