Skip to content

Docker fixes and other improvements#19

Open
PolynomialDivision wants to merge 13 commits into
LibreTranslate:mainfrom
PolynomialDivision:docker-fixes-and-other-improvements
Open

Docker fixes and other improvements#19
PolynomialDivision wants to merge 13 commits into
LibreTranslate:mainfrom
PolynomialDivision:docker-fixes-and-other-improvements

Conversation

@PolynomialDivision

Copy link
Copy Markdown
Contributor

Some small docker fixes, and gpu busy handling.

@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch from 32ba027 to 2a12950 Compare June 16, 2026 12:51
Concurrent requests serialize through a mutex. Previously the second
request would block forever and silently return the original input on
failure. Now it waits up to 120 seconds and returns HTTP 503 if the
server is still busy.
@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch from 2a12950 to 2f4a01b Compare June 16, 2026 16:45
@PolynomialDivision

Copy link
Copy Markdown
Contributor Author

Runtested on my unraid, except the new gemma 4 model.

@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch from 2f4a01b to e5d8e78 Compare June 16, 2026 17:05
@PolynomialDivision PolynomialDivision marked this pull request as draft June 16, 2026 18:26
…cpp-2

Drops the git submodule in favor of llama-cpp-2 v0.1.146 from crates.io
(utilityai/llama-cpp-rs). The fork was needed when upstream lacked Gemma 3
chat template support. Use token_to_piece instead of the deprecated
token_to_bytes / Special API.
@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch from e5d8e78 to 03859c6 Compare June 16, 2026 21:47
@PolynomialDivision PolynomialDivision marked this pull request as ready for review June 16, 2026 21:52
@PolynomialDivision

Copy link
Copy Markdown
Contributor Author

gemma4 works now

@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch 2 times, most recently from d58d1ff to adf2500 Compare June 17, 2026 08:48
Gemma 4's template hides <start_of_turn> in macros so detection fails.
Fall back to hardcoded Gemma format when apply_chat_template returns an error.
@PolynomialDivision PolynomialDivision force-pushed the docker-fixes-and-other-improvements branch from adf2500 to 3de18cf Compare June 17, 2026 09:24
@PolynomialDivision

Copy link
Copy Markdown
Contributor Author

I tried to gorup this PR into several smaller PRs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant