Embedder by ArnabChatterjee20k · Pull Request #1 · appwrite/embedding

ArnabChatterjee20k · 2026-05-22T12:17:29Z

What does this PR do?

(Provide a description of what this PR does.)

Test Plan

(Write your test plan here. If you changed any code, please provide us with clear instructions on how you verified your changes work.)

Related PRs and Issues

(If this PR is related to any other PR or resolves any issue or related to any issue link all related PR and issues here.)

Have you read the Contributing Guidelines on issues?

(Write your answer here.)

- Introduced `EmbeddingConfig` struct for managing embedding model configurations. - Implemented `EmbeddingClient` for handling model initialization and embedding generation. - Added environment variable support for model selection, cache directory, and pool size. - Created a main function to demonstrate model loading and embedding generation with sample texts. - Integrated `fastembed` library for embedding functionalities.

…ctionality

…mbedding client

…lization

… and add integration tests

…rkflow, and README documentation

…dingClient

ArnabChatterjee20k · 2026-05-22T12:28:16Z

@copilot create a pr fixing the dockerfile

Copilot · 2026-05-22T12:28:26Z

@ArnabChatterjee20k I've opened a new pull request, #2, to work on those changes. Once the pull request is ready, I'll request review from you.

Agent-Logs-Url: https://github.com/appwrite/embedding/sessions/4f5decfe-9143-42bd-b0e6-811ed2616fcb Co-authored-by: ArnabChatterjee20k <83803257+ArnabChatterjee20k@users.noreply.github.com>

Agent-Logs-Url: https://github.com/appwrite/embedding/sessions/5bc8a2a9-7dfa-400d-9834-850b8f96eb8d Co-authored-by: ArnabChatterjee20k <83803257+ArnabChatterjee20k@users.noreply.github.com>

Set default runtime model cache path and fix Docker CI compatibility

greptile-apps · 2026-05-26T12:39:22Z

Greptile Summary

This PR introduces a new Rust microservice that serves text embeddings over HTTP using the fastembed library with ONNX-backed models. It includes memory-aware pool sizing, round-robin instance dispatch, dynamic sub-batch computation, a Docker image, and a CI workflow.

src/embedding.rs: Implements EmbeddingClient with a per-model Vec<Arc<Mutex<TextEmbedding>>> pool, warmup-based memory measurement for pool sizing, and async batched inference split across tokio::task::spawn_blocking workers.
src/main.rs: Thin Axum server exposing a single POST /embed route; validates that texts is non-empty but has no upper-bound guard.
tests/embed_e2e.rs: Integration tests — the non-ignored embed_unknown_alias_returns_error test silently passes in CI without exercising its assertion because it uses .ok() to swallow a model-loading failure that occurs before the alias check.

Confidence Score: 4/5

Safe to merge with one test correctness fix — the alias-rejection test never executes its assertion in CI and should be restructured or marked #[ignore].

The alias-rejection test in tests/embed_e2e.rs swallows the model-loading error with .ok() and wraps its assertions in if let Some(client), so on a CI runner without a cached model the entire assertion block is skipped and the test passes vacuously. This means the alias-validation code path is not exercised in CI at all, despite appearing in the test suite.

tests/embed_e2e.rs — the embed_unknown_alias_returns_error test needs restructuring to actually run its assertion. src/model.rs — the dimension() function should add an explicit arm for EmbeddingGemma300M.

Important Files Changed

Filename	Overview
src/embedding.rs	Core embedding client: pool management, memory-aware sizing, and async batched inference. Previously flagged issues remain; no new logic bugs in this file.
src/main.rs	Axum HTTP server with single /embed route; no upper bound on texts array size and no graceful-shutdown signal handler (previously flagged).
src/model.rs	Model alias resolution and dimension lookup; EmbeddingGemma300M falls through to the 768 catch-all arm in dimension().
tests/embed_e2e.rs	E2e integration tests; embed_unknown_alias_returns_error is not #[ignore]d but requires a real model download, so it vacuously passes in CI without exercising the assertion.
src/lib.rs	Library crate root; re-exports the public surface cleanly, no issues.
Dockerfile	Multi-stage build with non-root user and BuildKit cache mounts; looks correct.
.github/workflows/ci.yml	CI workflow targets main branch on push, not master (previously flagged); otherwise fmt/clippy/test steps are well-structured.
Cargo.toml	Dependencies pinned with explicit versions; tokio signal feature is included in preparation for graceful shutdown.

_{Reviews (2): Last reviewed commit: "feat: implement multi-model support in E..." | Re-trigger Greptile}

greptile-apps · 2026-05-26T12:39:27Z

+on:
+  push:
+    branches: [main]
+  pull_request:


CI push trigger targets main, but the repository's default branch (and the base for this PR) is master. As-is, the test and docker jobs will never fire on a direct push to master, so merges to the primary branch go unvalidated by CI.

Suggested change

on:

push:

branches: [main]

pull_request:

on:

push:

branches: [master]

pull_request:

greptile-apps · 2026-05-26T12:39:36Z

+#[tokio::main]
+async fn main() -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
+    tracing_subscriber::fmt()
+        .with_env_filter(
+            tracing_subscriber::EnvFilter::try_from_default_env()
+                .unwrap_or_else(|_| tracing_subscriber::EnvFilter::new("info")),
+        )
+        .init();
+
+    let config = EmbeddingConfig::from_env();
+    let client = Arc::new(EmbeddingClient::new(config)?);
+    let state = AppState { client };
+
+    let app = Router::new().route("/embed", post(embed)).with_state(state);
+
+    let addr = std::env::var("BIND_ADDR").unwrap_or_else(|_| "0.0.0.0:3000".to_string());
+    let listener = tokio::net::TcpListener::bind(&addr).await?;
+    tracing::info!("listening on {}", addr);
+    axum::serve(listener, app).await?;
+
+    Ok(())
+}


No graceful shutdown handling

axum::serve is awaited directly with no signal hook. When the container receives SIGTERM, the process is killed immediately and any in-flight embedding requests are dropped. Axum supports a .with_graceful_shutdown(signal) overload that drains connections before exit.

…figuration

greptile-apps · 2026-05-26T15:48:53Z

+#[tokio::test(flavor = "multi_thread")]
+async fn embed_unknown_alias_returns_error() {
+    // No model loading on this path — fails the alias check first.
+    let client = EmbeddingClient::new(small_model_config(1)).ok();
+    if let Some(client) = client {
+        let err = client.embed("not-a-real-model", &["x"]).await.unwrap_err();
+        assert!(err.contains("unknown model alias"), "got: {}", err);
+    }
+}


Test silently passes in CI without exercising the assertion

EmbeddingClient::new(small_model_config(1)) calls Self::load_model which downloads the AllMiniLML6V2 ONNX model — this is the same model loading path as the #[ignore]d tests. In a fresh CI environment (no cached model), construction fails and .ok() produces None, so the if let Some(client) block is never entered and the actual alias-rejection assertion never runs. The test passes vacuously every CI run, giving false assurance. The comment "No model loading on this path" is incorrect — the alias check lives in embed(), which is never reached if new() fails.

The cleanest fix is to test alias rejection through model::from_name directly (no client required), or add #[ignore] like the other e2e tests that need a real model.

ArnabChatterjee20k added 10 commits May 22, 2026 14:34

feat: update embedding configuration and enhance embedding client fun…

4599430

…ctionality

refactor: rename EmbedOutcome to EmbeddingResult for consistency in e…

396ece4

…mbedding client

feat: add embedding endpoint with request handling and response seria…

dea5e94

…lization

feat: implement next_index function for round-robin model acquisition…

207be4e

… and add integration tests

feat: add initial Docker setup with Dockerfile, docker-compose, CI wo…

d196e8d

…rkflow, and README documentation

Merge remote-tracking branch 'origin/master' into embedder

cbd0d1c

updaetd dockerfile

70bf5dc

fix: improve error handling and memory pool size calculation in Embed…

141afde

…dingClient

fix: add missing g++ installation in Dockerfile builder stage

161d433

Initial plan

29dcd99

Copilot AI mentioned this pull request May 22, 2026

Set default runtime model cache path and fix Docker CI compatibility #2

Merged

Copilot AI and others added 6 commits May 22, 2026 12:33

fix: configure default cache path in Dockerfile runtime image

b6ccb7c

Agent-Logs-Url: https://github.com/appwrite/embedding/sessions/4f5decfe-9143-42bd-b0e6-811ed2616fcb Co-authored-by: ArnabChatterjee20k <83803257+ArnabChatterjee20k@users.noreply.github.com>

fix: scope models directory ownership change in Dockerfile

70bd4b9

Agent-Logs-Url: https://github.com/appwrite/embedding/sessions/4f5decfe-9143-42bd-b0e6-811ed2616fcb Co-authored-by: ArnabChatterjee20k <83803257+ArnabChatterjee20k@users.noreply.github.com>

fix: align Docker images to trixie for ORT/glibc compatibility

1ee5f4c

Agent-Logs-Url: https://github.com/appwrite/embedding/sessions/5bc8a2a9-7dfa-400d-9834-850b8f96eb8d Co-authored-by: ArnabChatterjee20k <83803257+ArnabChatterjee20k@users.noreply.github.com>

Merge pull request #2 from appwrite/copilot/sub-pr-1

d961e2c

Set default runtime model cache path and fix Docker CI compatibility

triggering greptile

5ec5abd

Merge remote-tracking branch 'origin/embedder' into embedder

b051f46

greptile-apps Bot reviewed May 26, 2026

View reviewed changes

ArnabChatterjee20k added 2 commits May 26, 2026 18:10

updated the env

2e1a3d3

feat: implement multi-model support in EmbeddingClient and update con…

9882616

…figuration

greptile-apps Bot reviewed May 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedder#1

Embedder#1
ArnabChatterjee20k wants to merge 19 commits into
masterfrom
embedder

ArnabChatterjee20k commented May 22, 2026

Uh oh!

ArnabChatterjee20k commented May 22, 2026

Uh oh!

Copilot AI commented May 22, 2026

Uh oh!

greptile-apps Bot commented May 26, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot May 26, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps Bot May 26, 2026

Uh oh!

greptile-apps Bot May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ArnabChatterjee20k commented May 22, 2026

What does this PR do?

Test Plan

Related PRs and Issues

Have you read the Contributing Guidelines on issues?

Uh oh!

ArnabChatterjee20k commented May 22, 2026

Uh oh!

Copilot AI commented May 22, 2026

Uh oh!

greptile-apps Bot commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Uh oh!

greptile-apps Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps Bot commented May 26, 2026 •

edited

Loading