Init adr 001
This commit is contained in:
parent
cfb6e03d98
commit
7257fe95ee
47
adr/adr-001-jan-deployable-cloud-native.md
Normal file
47
adr/adr-001-jan-deployable-cloud-native.md
Normal file
@ -0,0 +1,47 @@
|
|||||||
|
# ADR #011: Jan deployable cloud-native
|
||||||
|
|
||||||
|
## Changelog
|
||||||
|
|
||||||
|
- 23.10.03: Initial unfinished draft
|
||||||
|
|
||||||
|
## Authors
|
||||||
|
|
||||||
|
- @nam-john-ho
|
||||||
|
|
||||||
|
## Context
|
||||||
|
|
||||||
|
### Status Quo
|
||||||
|
|
||||||
|
User doesn't have a local GPU machine but wants to run Jan on a rented server
|
||||||
|
User wants a quick, fast way to experiment with Jan on a rented GPU
|
||||||
|
https://github.com/janhq/jan/issues/255
|
||||||
|
|
||||||
|
## Decision
|
||||||
|
|
||||||
|
This ADR aims to outline design decisions for deploying Jan in cloud native environments such as: Runpod, AWS, Azure, GCP in a fast and simple way.
|
||||||
|
The current code-base should not change too much.
|
||||||
|
The current plugins should be reusable across enviroments (Desktop, Cloud-native).
|
||||||
|
Simple authentication (username/password) should be supported.
|
||||||
|
|
||||||
|
|
||||||
|
### Key Design Decisions
|
||||||
|

|
||||||
|
|
||||||
|
|
||||||
|
### Detailed Design
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
## Alternative Approaches
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
## Considerations
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
https://www.runpod.io/console/templates
|
||||||
|
https://repost.aws/articles/ARQ0Tz9eorSL6EAus7XPMG-Q/how-to-install-textgen-webui-on-aws
|
||||||
|
https://www.youtube.com/watch?v=_59AsSyMERQ
|
||||||
|
https://gpus.llm-utils.org/running-llama-2-on-runpod-with-oobaboogas-text-generation-webui/
|
||||||
|
https://medium.com/@jarimh1984/installing-oobabooga-and-oobabooga-api-to-runpod-cloud-step-by-step-tutorial-47457974dfa5
|
||||||
BIN
adr/images/adr-001-01.png
Normal file
BIN
adr/images/adr-001-01.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 172 KiB |
Loading…
x
Reference in New Issue
Block a user