-
Notifications
You must be signed in to change notification settings - Fork 175
Issues: awslabs/data-on-eks
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Ray Logging config for Ray Head pod and export logs to S3
enhancement
New feature or request
#552
opened Jun 5, 2024 by
vara-bonthu
Ray Observability with Prometheus and AMP
enhancement
New feature or request
#551
opened Jun 5, 2024 by
vara-bonthu
vLLM with RayServe pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#547
opened Jun 3, 2024 by
shivam-dubey-1
Llama-3 on Inferentia generate infinite and meaningless output
#544
opened May 29, 2024 by
yubingjiaocn
1 task done
Incorrect POD name "aws-cli-cmd-shell" given in the instructions.
#543
opened May 29, 2024 by
AbrahamArellano
1 task done
How to run Data EKS Gen AI models with limited EC2 vCPUs service quota?
#539
opened May 25, 2024 by
Gall-oDrone
JARK Stack - Error while launching training step in the dogbooth Jupyter notebook
#537
opened May 20, 2024 by
rivasdam
1 task done
Incorrect command to provide Linux permission on the AWS Trainium on EKS Blueprint
bug
Something isn't working
documentation
Improvements or additions to documentation
#533
opened May 17, 2024 by
AbrahamArellano
1 task done
[Website] Add Scalability Best Practices & Considerations for DoEKS Workloads
documentation
Improvements or additions to documentation
#532
opened May 15, 2024 by
brianhammons
Re-introduce plan-examples.yml with a proper fix
bug
Something isn't working
#525
opened May 10, 2024 by
askulkarni2
Chore: Kubernetes cluster version upgrades
good first issue
Good for newcomers
stale
#520
opened May 8, 2024 by
raykrueger
[Inference]: NVIDIA Triton Server with vLLM gen-ai pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
stale
#518
opened May 7, 2024 by
vara-bonthu
Update documentation for JupyterHub on EKS solution
bug
Something isn't working
documentation
Improvements or additions to documentation
#515
opened May 2, 2024 by
petrokashlikov
1 task done
[Inference]: RayServe with NVIDIA Triton server pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#509
opened Apr 25, 2024 by
vara-bonthu
[Inference]: NVIDIA Triton Server with TensortRT-LLM pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
stale
#508
opened Apr 25, 2024 by
vara-bonthu
[Inference]: Mistral7b on GPUs with JARK stack with Ray Serve
enhancement
New feature or request
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#497
opened Apr 8, 2024 by
vara-bonthu
[Website]: JARK Website doc for Stable Diffusion Inference on GPUs
documentation
Improvements or additions to documentation
enhancement
New feature or request
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#496
opened Apr 8, 2024 by
vara-bonthu
deploy gradio app for llama2 on inf2/ray to k8s
enhancement
New feature or request
#495
opened Apr 8, 2024 by
harishvs
The inf2/ray gradio app does not format new lines in the output
enhancement
New feature or request
#494
opened Apr 8, 2024 by
harishvs
1 task done
Add temprature, topk, topk and other input params to UI for llama2 gradio application on inf2/ray cluster
enhancement
New feature or request
#493
opened Apr 8, 2024 by
harishvs
Previous Next
ProTip!
Updated in the last three days: updated:>2024-06-07.