Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Nebius AI Studio support: inference platform with dual performance options and competitive pricing #2805

Open
2 tasks done
demianarc opened this issue Nov 4, 2024 · 4 comments
Assignees
Labels
kind:enhancement Indicates a new feature request, imrovement, or extension priority:medium Indicates medium priority

Comments

@demianarc
Copy link

Validations

  • I believe this is a way to improve. I'll try to join the Continue Discord for questions
  • I'm not able to find an open issue that requests the same enhancement

Problem

Developers using Continue face several challenges that impact their coding flow:

Cost barriers when scaling AI usage across teams

High token costs limiting experimentation and usage
Difficulty predicting and managing AI development expenses
Budget constraints affecting adoption across organizations

Performance vs. cost trade-offs

Need to choose between fast responses and budget optimization
Inconsistent performance affecting development flow
Limited options for different coding scenarios (quick prototyping vs. production code)

Model accessibility and flexibility

Limited options for specialized coding models
Difficulty accessing cutting-edge open-source models
Need for different models for various development tasks

Solution

Integrating Nebius AI Studio with Continue would enhance the development experience by providing:

For individual developers:

Uninterrupted coding flow with dual model options:
"Fast" flavor for real-time pair programming
"Base" flavor for cost-effective batch operations

  • Seamless access to specialized coding models like DeepSeek Coder, Meta-Llama-3.1-Nemotron-70B, Qwen/Qwen2.5-72B-Instruct and much more
  • $100 in free credits to experiment and find optimal settings

For Teams and organizations:

  • Affordable and flexible pricing model with separate input/output token rates
  • Centralized access to multiple state-of-the-art models
  • Batch processing capabilities for efficient large-scale operations

Technical Integration benefits:

  • Drop-in OpenAI API compatibility
  • Rich selection of coding-optimized models
  • Structured output support for precise code generation
  • Simple API key authentication

Implementation approach:

  1. Add Nebius AI Studio as a model provider alongside existing options
  2. Implement model flavor selection in Continue's configuration
  3. Maybe integrate batch processing for improved performance

This integration aligns perfectly with Continue's mission to amplify developers and enhance development through AI, while adding valuable flexibility in model choice and cost optimization.
We're ready to contribute to the implementation and collaborate with the Continue team to ensure a seamless integration that enhances the developer experience.

@sestinj sestinj self-assigned this Nov 4, 2024
@dosubot dosubot bot added kind:enhancement Indicates a new feature request, imrovement, or extension priority:medium Indicates medium priority labels Nov 4, 2024
@sestinj
Copy link
Contributor

sestinj commented Nov 5, 2024

@demianarc Thanks for the great write-up here! I was already excited about the PR the other day, but sounds like the next outcome would be to add the flavor option in config.json. I think probably this would fit well in the ModelDescription type in index.d.ts, which can then be read from config.json and passed into the LLM class here

@demianarc
Copy link
Author

demianarc commented Nov 6, 2024 via email

@RomneyDa
Copy link
Collaborator

RomneyDa commented Nov 8, 2024

@demianarc also would love a contribution on the docs page! Original PR has no docs

For example,
https://github.com/continuedev/continue/blob/dev/docs/docs/customize/model-providers/more/nvidia.md

Any Nebius specific notes, direction to the nebius docs, etc. would be great

@vadjs
Copy link
Contributor

vadjs commented Nov 11, 2024

Hi @RomneyDa,
I'm author of the original PR. Sorry for the missing documentation. I added it in the following PR as well as other improvements.
#2875

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind:enhancement Indicates a new feature request, imrovement, or extension priority:medium Indicates medium priority
Projects
None yet
Development

No branches or pull requests

4 participants