[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[GNU ELPA] Llm version 0.17.4
From: |
ELPA update |
Subject: |
[GNU ELPA] Llm version 0.17.4 |
Date: |
Sun, 08 Sep 2024 05:04:29 -0400 |
Version 0.17.4 of package Llm has just been released in GNU ELPA.
You can now find it in M-x list-packages RET.
Llm describes itself as:
===================================
Interface to pluggable llm backends
===================================
More at https://elpa.gnu.org/packages/llm.html
## Summary:
━━━━━━━━━━━━━━━━━━━━━━━
LLM PACKAGE FOR EMACS
━━━━━━━━━━━━━━━━━━━━━━━
1 Introduction
══════════════
This library provides an interface for interacting with Large Language
Models (LLMs). It allows elisp code to use LLMs while also giving
end-users the choice to select their preferred LLM. This is
particularly beneficial when working with LLMs since various
high-quality models exist, some of which have paid API access, while
others are locally installed and free but offer medium
quality. Applications using LLMs can utilize this library to ensure
compatibility regardless of whether the user has a local LLM or is
paying for API access.
## Recent NEWS:
1 Version 0.17.4
════════════════
• Fix problem with Open AI's `llm-chat-token-limit'.
• Fix Open AI and Gemini's parallel function calling.
• Add variable `llm-prompt-default-max-tokens' to put a cap on number
of tokens regardless of model size.
2 Version 0.17.3
════════════════
• More fixes with Claude and Ollama function calling conversation,
thanks to Paul Nelson.
• Make `llm-chat-streaming-to-point' more efficient, just inserting
new text, thanks to Paul Nelson.
• Don't output streaming information when `llm-debug' is true, since
it tended to be overwhelming.
3 Version 0.17.2
════════════════
• Fix compiled functions not being evaluated in `llm-prompt'.
• Use Ollama's new `embed' API instead of the obsolete one.
• Fix Claude function calling conversations
• Fix issue in Open AI streaming function calling.
• Update Open AI and Claude default chat models to the later models.
4 Version 0.17.1
════════════════
• Support Ollama function calling, for models which support it.
• Make sure every model, even unknown models, return some value for
`llm-chat-token-limit'.
• Add token count for llama3.1 model.
• Make `llm-capabilities' work model-by-model for embeddings and
functions
5 Version 0.17.0
════════════════
• Introduced `llm-prompt' for prompt management and creation from
generators.
• Removed Gemini and Vertex token counting, because `llm-prompt' uses
token counting often and it's best to have a quick estimate than a
more expensive more accurate count.
6 Version 0.16.2
════════════════
• Fix Open AI's gpt4-o context length, which is lower for most paying
users than the max.
7 Version 0.16.1
════════════════
• Add support for HTTP / HTTPS proxies.
8 Version 0.16.0
════════════════
• Add "non-standard params" to set per-provider options.
• Add default parameters for chat providers.
9 Version 0.15.0
════════════════
• Move to `plz' backend, which uses `curl'. This helps move this
package to a stronger foundation backed by parsing to spec. Thanks
to Roman Scherer for contributing the `plz' extensions that enable
this, which are currently bundled in this package but will
eventually become their own separate package.
• Add model context information for Open AI's GPT 4-o.
• Add model context information for Gemini's 1.5 models.
10 Version 0.14.2
═════════════════
• Fix mangled copyright line (needed to get ELPA version unstuck).
• Fix Vertex response handling bug.
11 Version 0.14.1
═════════════════
• Fix various issues with the 0.14 release
12 Version 0.14
═══════════════
• Introduce new way of creating prompts: llm-make-chat-prompt,
deprecating the older ways.
• Improve Vertex error handling
13 Version 0.13
═══════════════
• Add Claude's new support for function calling.
• Refactor of providers to centralize embedding and chat logic.
• Remove connection buffers after use.
• Fixes to provider more specific error messages for most providers.
14 Verson 0.12.3
════════════════
• Refactor of warn-non-nonfree methods.
• Add non-free warnings for Gemini and Claude.
15 Version 0.12.2
═════════════════
• Send connection issues to error callbacks, and fix an error handling
issue in Ollama.
• Fix issue where, in some cases, streaming does not work the first
time attempted.
16 Version 0.12.1
═════════════════
• Fix issue in `llm-ollama' with not using provider host for sync
embeddings.
• Fix issue in `llm-openai' where were incompatible with some Open
AI-compatible backends due to assumptions about inconsequential JSON
details.
17 Version 0.12.0
═════════════════
• Add provider `llm-claude', for Anthropic's Claude.
18 Version 0.11.0
═════════════════
• Introduce function calling, now available only in Open AI and
Gemini.
• Introduce `llm-capabilities', which returns a list of extra
capabilities for each backend.
• Fix issue with logging when we weren't supposed to.
19 Version 0.10.0
═════════════════
• Introduce llm logging (for help with developing against `llm'), set
`llm-log' to non-nil to enable logging of all interactions with the
`llm' package.
• Change the default interaction with ollama to one more suited for
converesations (thanks to Thomas Allen).
20 Version 0.9.1
════════════════
• Default to the new "text-embedding-3-small" model for Open AI.
*Important*: Anyone who has stored embeddings should either
regenerate embeddings (recommended) or hard-code the old embedding
model ("text-embedding-ada-002").
… …
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [GNU ELPA] Llm version 0.17.4,
ELPA update <=