SqueezeBits Inc.
- 49 followers
- Korea, South
- https://squeezebits.com/
- info@squeezebits.com
Popular repositories Loading
-
Torch-TRTLLM
Torch-TRTLLM PublicDitto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.
-
owlite-examples
owlite-examples PublicOwLite Examples repository offers illustrative example codes to help users seamlessly compress PyTorch deep learning models and transform them into TensorRT engines.
-
Repositories
- llm-compressor-K-EXAONE Public Forked from vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
SqueezeBits/llm-compressor-K-EXAONE’s past year of commit activity - vla-evaluation-harness Public Forked from allenai/vla-evaluation-harness
One framework to evaluate any VLA model on any robot simulation benchmark.
SqueezeBits/vla-evaluation-harness’s past year of commit activity - JacobiForcing-K-LLM Public Forked from hao-ai-lab/JacobiForcing
Jacobi Forcing: Fast and Accurate Diffusion-style Decoding
SqueezeBits/JacobiForcing-K-LLM’s past year of commit activity - optimum-rbln Public Forked from RBLN-SW/optimum-rbln
⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient inference on RBLN NPUs.
SqueezeBits/optimum-rbln’s past year of commit activity - diffusers_benchmark Public
SqueezeBits/diffusers_benchmark’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
SqueezeBits/vllm’s past year of commit activity - guidellm-rbln-vllm Public
SqueezeBits/guidellm-rbln-vllm’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…