Skip to content

Dockerizing#1

Draft
stepdi wants to merge 92 commits into
mainfrom
docker
Draft

Dockerizing#1
stepdi wants to merge 92 commits into
mainfrom
docker

Conversation

@stepdi
Copy link
Copy Markdown
Collaborator

@stepdi stepdi commented May 14, 2025

No description provided.

stepdi and others added 30 commits May 13, 2025 22:22
…cking

feat: add cost tracking (clean PR)
…e-mode

Add offline mode support (cherry-pick)
Robert Leonard and others added 25 commits May 31, 2025 23:35
…-stepname-model-selection

Fix: Ensure summarization uses correct model by aligning step name
…-markitdown

Improve Ingestion for Web Documents
…core-filtering

Fix citation score filtering
…n permissions (huggingface#101)

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
…g the full document text into each row (useful when your docs are large)
Changed model to gpt-4o (from gpt-4.1).
Comment thread README.docker.md

The container requires the following environment variables:

- `INPUT_S3_BUCKET`: S3 bucket name for input data
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think we are missing some of the variables here and below in the example docker run command

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants