Llama CPP

兼容性

仅在 Node.js 上可用。

此模块基于 node-llama-cpp Node.js 绑定，用于 llama.cpp，使您能够使用本地运行的 LLM。这使您可以使用更小的量化模型，该模型能够在笔记本电脑环境中运行，非常适合测试和快速验证想法，而无需产生费用！

设置

您需要安装主要版本 3 的 node-llama-cpp 模块，以便与您的本地模型通信。

提示

请参阅此部分，获取有关安装集成包的通用说明。

npm
Yarn
pnpm

npm install -S node-llama-cpp@3 @langchain/community @langchain/core

yarn add node-llama-cpp@3 @langchain/community @langchain/core

pnpm add node-llama-cpp@3 @langchain/community @langchain/core

您还需要一个本地 Llama 3 模型（或 node-llama-cpp 支持的模型）。您需要将此模型的路径作为参数的一部分传递给 LlamaCpp 模块（请参阅示例）。

开箱即用的 node-llama-cpp 经过调整，可在 MacOS 平台上运行，并支持 Apple M 系列处理器的 Metal GPU。如果您需要关闭此功能或需要支持 CUDA 架构，请参阅 node-llama-cpp 的文档。

有关获取和准备 llama3 的建议，请参阅此模块的 LLM 版本的文档。

LangChain.js 贡献者的注意事项：如果您想运行与此模块关联的测试，您需要将本地模型的路径放在环境变量 LLAMA_PATH 中。

使用

基本使用

在这种情况下，我们传入一个包装为消息的提示，并期望得到响应。

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";
import { HumanMessage } from "@langchain/core/messages";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const model = await ChatLlamaCpp.initialize({ modelPath: llamaPath });

const response = await model.invoke([
  new HumanMessage({ content: "My name is John." }),
]);
console.log({ response });

/*
  AIMessage {
    lc_serializable: true,
    lc_kwargs: {
      content: 'Hello John.',
      additional_kwargs: {}
    },
    lc_namespace: [ 'langchain', 'schema' ],
    content: 'Hello John.',
    name: undefined,
    additional_kwargs: {}
  }
*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp
HumanMessage 来自 @langchain/core/messages

系统消息

我们还可以提供系统消息，请注意，使用 llama_cpp 模块，系统消息将导致创建新会话。

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";
import { SystemMessage, HumanMessage } from "@langchain/core/messages";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const model = await ChatLlamaCpp.initialize({ modelPath: llamaPath });

const response = await model.invoke([
  new SystemMessage(
    "You are a pirate, responses must be very verbose and in pirate dialect, add 'Arr, m'hearty!' to each sentence."
  ),
  new HumanMessage("Tell me where Llamas come from?"),
]);
console.log({ response });

/*
  AIMessage {
    lc_serializable: true,
    lc_kwargs: {
      content: "Arr, m'hearty! Llamas come from the land of Peru.",
      additional_kwargs: {}
    },
    lc_namespace: [ 'langchain', 'schema' ],
    content: "Arr, m'hearty! Llamas come from the land of Peru.",
    name: undefined,
    additional_kwargs: {}
  }
*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp
SystemMessage 来自 @langchain/core/messages
HumanMessage 来自 @langchain/core/messages

链

此模块也可以与链一起使用，请注意，使用更复杂的链将需要足够强大的 llama3 版本，例如 70B 版本。

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";
import { LLMChain } from "langchain/chains";
import { PromptTemplate } from "@langchain/core/prompts";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const model = await ChatLlamaCpp.initialize({
  modelPath: llamaPath,
  temperature: 0.5,
});

const prompt = PromptTemplate.fromTemplate(
  "What is a good name for a company that makes {product}?"
);
const chain = new LLMChain({ llm: model, prompt });

const response = await chain.invoke({ product: "colorful socks" });

console.log({ response });

/*
  {
  text: `I'm not sure what you mean by "colorful socks" but here are some ideas:\n` +
    '\n' +
    '- Sock-it to me!\n' +
    '- Socks Away\n' +
    '- Fancy Footwear'
  }
*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp
LLMChain 来自 langchain/chains
PromptTemplate 来自 @langchain/core/prompts

流式处理

我们还可以使用 Llama CPP 进行流式处理，这可以使用原始的“单提示”字符串

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const model = await ChatLlamaCpp.initialize({
  modelPath: llamaPath,
  temperature: 0.7,
});

const stream = await model.stream("Tell me a short story about a happy Llama.");

for await (const chunk of stream) {
  console.log(chunk.content);
}

/*

  Once
   upon
   a
   time
  ,
   in
   a
   green
   and
   sunny
   field
  ...
*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp

或者您可以提供多条消息，请注意，这会获取输入，然后向模型提交 Llama3 格式的提示。

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";
import { SystemMessage, HumanMessage } from "@langchain/core/messages";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const llamaCpp = await ChatLlamaCpp.initialize({
  modelPath: llamaPath,
  temperature: 0.7,
});

const stream = await llamaCpp.stream([
  new SystemMessage(
    "You are a pirate, responses must be very verbose and in pirate dialect."
  ),
  new HumanMessage("Tell me about Llamas?"),
]);

for await (const chunk of stream) {
  console.log(chunk.content);
}

/*

  Ar
  rr
  r
  ,
   me
   heart
  y
  !

   Ye
   be
   ask
  in
  '
   about
   llam
  as
  ,
   e
  h
  ?
  ...
*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp
SystemMessage 来自 @langchain/core/messages
HumanMessage 来自 @langchain/core/messages

使用 invoke 方法，我们还可以实现流式生成，并使用 signal 中止生成。

import { ChatLlamaCpp } from "@langchain/community/chat_models/llama_cpp";
import { SystemMessage, HumanMessage } from "@langchain/core/messages";

const llamaPath = "/Replace/with/path/to/your/model/gguf-llama3-Q4_0.bin";

const model = await ChatLlamaCpp.initialize({
  modelPath: llamaPath,
  temperature: 0.7,
});

const controller = new AbortController();

setTimeout(() => {
  controller.abort();
  console.log("Aborted");
}, 5000);

await model.invoke(
  [
    new SystemMessage(
      "You are a pirate, responses must be very verbose and in pirate dialect."
    ),
    new HumanMessage("Tell me about Llamas?"),
  ],
  {
    signal: controller.signal,
    callbacks: [
      {
        handleLLMNewToken(token) {
          console.log(token);
        },
      },
    ],
  }
);
/*

  Once
   upon
   a
   time
  ,
   in
   a
   green
   and
   sunny
   field
  ...
  Aborted

  AbortError

*/

API 参考

ChatLlamaCpp 来自 @langchain/community/chat_models/llama_cpp
SystemMessage 来自 @langchain/core/messages
HumanMessage 来自 @langchain/core/messages

聊天模型概念指南
聊天模型操作指南

Llama CPP

设置

使用

基本使用

API 参考

系统消息

API 参考

链

API 参考

流式处理

API 参考

API 参考

API 参考

此页面是否对您有帮助？

您也可以留下详细的反馈在 GitHub 上.

Llama CPP

设置​

使用​

基本使用​

API 参考

系统消息​

API 参考

链​

API 参考

流式处理​

API 参考

API 参考

API 参考

相关​

此页面是否对您有帮助？

您也可以留下详细的反馈 在 GitHub 上.

设置

使用

基本使用

系统消息

链

流式处理

相关

您也可以留下详细的反馈在 GitHub 上.