Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

marella / ctransformers Public

Notifications You must be signed in to change notification settings
Fork 140
Star 1.8k

Code
Issues 107
Pull requests 5
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: marella/ctransformers

Releases · marella/ctransformers

0.2.27

10 Sep 15:20

marella

Compare

Choose a tag to compare

Loading

0.2.27 Latest

Latest

Changes

Skip evaluating tokens that are evaluated in the past. This can significantly speed up prompt processing in chat applications that prepend previous messages to prompt.
Deprecate LLM.reset() method. Use high-level API instead.
Add support for batching and beam search to 🤗 model.
Remove universal binary option when building for AVX2, AVX on macOS.

Assets 2

Loading

Yossef-Dawoad and m1chae1bx reacted with heart emoji

All reactions

❤️ 2 reactions

2 people reacted

0.2.26

30 Aug 21:43

marella

Compare

Choose a tag to compare

Loading

0.2.26

Changes

Add support for 🤗 Transformers

Assets 2

Loading

m1chae1bx, lin72h, and kddubey reacted with hooray emoji

All reactions

🎉 3 reactions

3 people reacted

0.2.25

29 Aug 00:31

marella

Compare

Choose a tag to compare

Loading

0.2.25

Changes

Add support for GGUF v2
Add CUDA support for Falcon GGUF models
Add ROCm support
Add low-level API for add_bos_token, bos_token_id

Assets 2

Loading

m1chae1bx reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

0.2.24

24 Aug 23:38

marella

Compare

Choose a tag to compare

Loading

0.2.24

Changes

Add GGUF format support for Llama and Falcon models
Add support for Code Llama models

Assets 2

Loading

m1chae1bx, dnth, aman-saini-402, and Prathamesh-dev-code reacted with hooray emoji

All reactions

🎉 4 reactions

4 people reacted

0.2.23

20 Aug 19:20

marella

Compare

Choose a tag to compare

Loading

0.2.23

Changes

Add mmap and mlock parameters for LLaMA and Falcon models
Add revision option for models on Hugging Face Hub

Assets 2

Loading

m1chae1bx reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

0.2.22

12 Aug 15:22

marella

Compare

Choose a tag to compare

Loading

0.2.22

Changes

Add experimental CUDA support for StarCoder, StarChat models
Add gpt_bigcode as model type for StarCoder, StarChat models
Fix loading GPTQ models from a local path

Assets 2

Loading

bmorphism, krzysiekpodk, m1chae1bx, and chenhunghan reacted with rocket emoji

BBC-Esq reacted with eyes emoji

All reactions

🚀 4 reactions
👀 1 reaction

5 people reacted

0.2.21

07 Aug 19:00

marella

Compare

Choose a tag to compare

Loading

0.2.21

Changes

Simplify CUDA installation by using precompiled runtime libraries from NVIDIA

Assets 2

Loading

m1chae1bx and th4tkh13m reacted with hooray emoji

All reactions

🎉 2 reactions

2 people reacted

0.2.20

05 Aug 18:50

marella

Compare

Choose a tag to compare

Loading

0.2.20

Changes

Add experimental CUDA support for MPT models

Assets 2

Loading

Solido and m1chae1bx reacted with eyes emoji

All reactions

👀 2 reactions

2 people reacted

0.2.19

04 Aug 22:32

marella

Compare

Choose a tag to compare

Loading

0.2.19

Changes

Add Metal support for LLaMA 2 70B models
Update llama.cpp

Assets 2

Loading

All reactions

0.2.18

02 Aug 20:09

marella

Compare

Choose a tag to compare

Loading

0.2.18

Changes

Add experimental support for GPTQ models using ExLlama

Assets 2

Loading

chenhunghan and Solido reacted with hooray emoji

All reactions

🎉 2 reactions

2 people reacted

Previous 1 2 3 Next

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.