Changelog

v0.137.0 - 2024-02-24

Changed

Moved cost calculation into @modelfusion/cost-calculation package. Thanks @jakedetels for the refactoring!

v0.136.0 - 2024-02-07

Added

FileCache for caching responses to disk. Thanks @jakedetels for the feature! Example:

import { generateText, openai } from "modelfusion";
import { FileCache } from "modelfusion/node";

const cache = new FileCache();

const text1 = await generateText({
  model: openai
    .ChatTextGenerator({ model: "gpt-3.5-turbo", temperature: 1 })
    .withTextPrompt(),
  prompt: "Write a short story about a robot learning to love",
  logging: "basic-text",
  cache,
});

console.log({ text1 });

const text2 = await generateText({
  model: openai
    .ChatTextGenerator({ model: "gpt-3.5-turbo", temperature: 1 })
    .withTextPrompt(),
  prompt: "Write a short story about a robot learning to love",
  logging: "basic-text",
  cache,
});

console.log({ text2 }); // same text

v0.135.1 - 2024-02-04

Fixed

Try both dynamic imports and require for loading libraries on demand.

v0.135.0 - 2024-01-29

Added

ObjectGeneratorTool: a tool to create synthetic or fictional structured data using generateObject. Docs
jsonToolCallPrompt.instruction(): Create a instruction prompt for tool calls that uses JSON.

Changed

jsonToolCallPrompt automatically enables JSON mode or grammars when supported by the model.

v0.134.0 - 2024-01-28

Added

Added prompt function support to generateText, streamText, generateObject, and streamObject. You can create prompt functions for text, instruction, and chat prompts using createTextPrompt, createInstructionPrompt, and createChatPrompt. Prompt functions allow you to load prompts from external sources and improve the prompt logging. Example:

const storyPrompt = createInstructionPrompt(
  async ({ protagonist }: { protagonist: string }) => ({
    system: "You are an award-winning author.",
    instruction: `Write a short story about ${protagonist} learning to love.`,
  })
);

const text = await generateText({
  model: openai
    .ChatTextGenerator({ model: "gpt-3.5-turbo" })
    .withInstructionPrompt(),

  prompt: storyPrompt({
    protagonist: "a robot",
  }),
});

Changed

Refactored build to use tsup.

v0.133.0 - 2024-01-26

Added

Support for OpenAI embedding custom dimensions.

Changed

breaking change: renamed embeddingDimensions setting to dimensions

v0.132.0 - 2024-01-25

Added

Support for OpenAI text-embedding-3-small and text-embedding-3-large embedding models.
Support for OpenAI gpt-4-turbo-preview, gpt-4-0125-preview, and gpt-3.5-turbo-0125 chat models.

v0.131.1 - 2024-01-25

Fixed

Add type-fest as dependency to fix type inference errors.

v0.131.0 - 2024-01-23

Added

ObjectStreamResponse and ObjectStreamFromResponse serialization functions for using server-generated object streams in web applications.

Server example:

export async function POST(req: Request) {
  const { myArgs } = await req.json();

  const objectStream = await streamObject({
    // ...
  });

  // serialize the object stream to a response:
  return new ObjectStreamResponse(objectStream);
}

Client example:

const response = await fetch("/api/stream-object-openai", {
  method: "POST",
  body: JSON.stringify({ myArgs }),
});

// deserialize (result object is simpler than the full response)
const stream = ObjectStreamFromResponse({
  schema: itinerarySchema,
  response,
});

for await (const { partialObject } of stream) {
  // do something, e.g. setting a React state
}

Changed

breaking change: rename generateStructure to generateObject and streamStructure to streamObject. Related names have been changed accordingly.

breaking change: the streamObject result stream contains additional data. You need to use stream.partialObject or destructuring to access it:

const objectStream = await streamObject({
  // ...
});

for await (const { partialObject } of objectStream) {
  console.clear();
  console.log(partialObject);
}

breaking change: the result from successful Schema validations is stored in the value property (before: data).

v0.130.1 - 2024-01-22

Fixed

Duplex speech streaming works in Vercel Edge Functions.

v0.130.0 - 2024-01-21

Changed

breaking change: updated generateTranscription interface. The function now takes a mimeType and audioData (base64-encoded string, Uint8Array, Buffer or ArrayBuffer). Example:

import { generateTranscription, openai } from "modelfusion";
import fs from "node:fs";

const transcription = await generateTranscription({
  model: openai.Transcriber({ model: "whisper-1" }),
  mimeType: "audio/mp3",
  audioData: await fs.promises.readFile("data/test.mp3"),
});

Images in instruction and chat prompts can be Buffer or ArrayBuffer instances (in addition to base64-encoded strings and Uint8Array instances).

v0.129.0 - 2024-01-20

Changed

breaking change: Usage of Node async_hooks has been renamed from node:async_hooks to async_hooks for easier Webpack configuration. To exclude the async_hooks from client-side bundling, you can use the following config for Next.js (next.config.mjs or next.config.js):

/**
 * @type {import('next').NextConfig}
 */
const nextConfig = {
  webpack: (config, { isServer }) => {
    if (isServer) {
      return config;
    }

    config.resolve = config.resolve ?? {};
    config.resolve.fallback = config.resolve.fallback ?? {};

    // async hooks is not available in the browser:
    config.resolve.fallback.async_hooks = false;

    return config;
  },
};

v0.128.0 - 2024-01-20

Changed

breaking change: ModelFusion uses Uint8Array instead of Buffer for better cross-platform compatibility (see also "Goodbye, Node.js Buffer"). This can lead to breaking changes in your code if you use Buffer-specific methods.

breaking change: Image content in multi-modal instruction and chat inputs (e.g. for GPT Vision) is passed in the image property (instead of base64Image) and supports both base64 strings and Uint8Array inputs:

const image = fs.readFileSync(path.join("data", "example-image.png"));

const textStream = await streamText({
  model: openai.ChatTextGenerator({
    model: "gpt-4-vision-preview",
    maxGenerationTokens: 1000,
  }),

  prompt: [
    openai.ChatMessage.user([
      { type: "text", text: "Describe the image in detail:\n\n" },
      { type: "image", image, mimeType: "image/png" },
    ]),
  ],
});

OpenAI-compatible providers with predefined API configurations have a customized provider name that shows up in the events.

v0.127.0 - 2024-01-15

Changed

breaking change: streamStructure returns an async iterable over deep partial objects. If you need to get the fully validated final result, you can use the fullResponse: true option and await the structurePromise value. Example:

const { structureStream, structurePromise } = await streamStructure({
  model: ollama
    .ChatTextGenerator({
      model: "openhermes2.5-mistral",
      maxGenerationTokens: 1024,
      temperature: 0,
    })
    .asStructureGenerationModel(jsonStructurePrompt.text()),

  schema: zodSchema(
    z.object({
      characters: z.array(
        z.object({
          name: z.string(),
          class: z
            .string()
            .describe("Character class, e.g. warrior, mage, or thief."),
          description: z.string(),
        })
      ),
    })
  ),

  prompt:
    "Generate 3 character descriptions for a fantasy role playing game.",

  fullResponse: true,
});

for await (const partialStructure of structureStream) {
  console.clear();
  console.log(partialStructure);
}

const structure = await structurePromise;

console.clear();
console.log("FINAL STRUCTURE");
console.log(structure);

breaking change: Renamed text value in streamText with fullResponse: true to textPromise.

Fixed

Ollama streaming.
Ollama structure generation and streaming.

v0.126.0 - 2024-01-15

Changed

breaking change: rename useTool to runTool and useTools to runTools to avoid confusion with React hooks.

v0.125.0 - 2024-01-14

Added

Perplexity AI chat completion support. Example:

import { openaicompatible, streamText } from "modelfusion";

const textStream = await streamText({
  model: openaicompatible
    .ChatTextGenerator({
      api: openaicompatible.PerplexityApi(),
      provider: "openaicompatible-perplexity",
      model: "pplx-70b-online", // online model with access to web search
      maxGenerationTokens: 500,
    })
    .withTextPrompt(),

  prompt: "What is RAG in AI?",
});

v0.124.0 - 2024-01-13

Added

Embedding-support for OpenAI-compatible providers. You can for example use the Together AI embedding endpoint:

import { embed, openaicompatible } from "modelfusion";

const embedding = await embed({
  model: openaicompatible.TextEmbedder({
    api: openaicompatible.TogetherAIApi(),
    provider: "openaicompatible-togetherai",
    model: "togethercomputer/m2-bert-80M-8k-retrieval",
  }),
  value: "At first, Nox didn't know what to do with the pup.",
});

v0.123.0 - 2024-01-13

Added

classify model function (docs) for classifying values. The SemanticClassifier has been renamed to EmbeddingSimilarityClassifier and can be used in conjunction with classify:

import { classify, EmbeddingSimilarityClassifier, openai } from "modelfusion";

const classifier = new EmbeddingSimilarityClassifier({
  embeddingModel: openai.TextEmbedder({ model: "text-embedding-ada-002" }),
  similarityThreshold: 0.82,
  clusters: [
    {
      name: "politics" as const,
      values: [
        "they will save the country!",
        // ...
      ],
    },
    {
      name: "chitchat" as const,
      values: [
        "how's the weather today?",
        // ...
      ],
    },
  ],
});

// strongly typed result:
const result = await classify({
  model: classifier,
  value: "don't you love politics?",
});

v0.122.0 - 2024-01-13

Changed

breaking change: Switch from positional parameters to named parameters (parameter object) for all model and tool functions. The parameter object is the first and only parameter of the function. Additional options (last parameter before) are now part of the parameter object. Example:

// old:
const text = await generateText(
  openai
    .ChatTextGenerator({
      model: "gpt-3.5-turbo",
      maxGenerationTokens: 1000,
    })
    .withTextPrompt(),

  "Write a short story about a robot learning to love",

  {
    functionId: "example-function",
  }
);

// new:
const text = await generateText({
  model: openai
    .ChatTextGenerator({
      model: "gpt-3.5-turbo",
      maxGenerationTokens: 1000,
    })
    .withTextPrompt(),

  prompt: "Write a short story about a robot learning to love",

  functionId: "example-function",
});

This change was made to make the API more flexible and to allow for future extensions.

v0.121.2 - 2024-01-11

Fixed

Ollama response schema for repeated calls with Ollama 0.1.19 completion models. Thanks @Necmttn for the bugfix!

v0.121.1 - 2024-01-10

Fixed

Ollama response schema for repeated calls with Ollama 0.1.19 chat models. Thanks @jakedetels for the bug report!

v0.121.0 - 2024-01-09

Added

Synthia prompt template

Changed

breaking change: Renamed parentCallId function parameter to callId to enable options pass-through.
Better output filtering for detailed-object log format (e.g. via modelfusion.setLogFormat("detailed-object"))

v0.120.0 - 2024-01-09

Added

OllamaCompletionModel supports setting the prompt template in the settings. Prompt formats are available under ollama.prompt.*. You can then call .withTextPrompt(), .withInstructionPrompt() or .withChatPrompt() to use a standardized prompt.

const model = ollama
  .CompletionTextGenerator({
    model: "mistral",
    promptTemplate: ollama.prompt.Mistral,
    raw: true, // required when using custom prompt template
    maxGenerationTokens: 120,
  })
  .withTextPrompt();

Removed

breaking change: removed .withTextPromptTemplate on OllamaCompletionModel.

v0.119.1 - 2024-01-08

Fixed

Incorrect export. Thanks @mloenow for the fix!

v0.119.0 - 2024-01-07

Added

Schema-specific GBNF grammar generator for LlamaCppCompletionModel. When using jsonStructurePrompt, it automatically uses a GBNF grammar for the JSON schema that you provide. Example:

const structure = await generateStructure(
  llamacpp
    .CompletionTextGenerator({
      // run openhermes-2.5-mistral-7b.Q4_K_M.gguf in llama.cpp
      promptTemplate: llamacpp.prompt.ChatML,
      maxGenerationTokens: 1024,
      temperature: 0,
    })
    // automatically restrict the output to your schema using GBNF:
    .asStructureGenerationModel(jsonStructurePrompt.text()),

  zodSchema(
    z.array(
      z.object({
        name: z.string(),
        class: z
          .string()
          .describe("Character class, e.g. warrior, mage, or thief."),
        description: z.string(),
      })
    )
  ),

  "Generate 3 character descriptions for a fantasy role playing game. "
);

v0.118.0 - 2024-01-07

Added

LlamaCppCompletionModel supports setting the prompt template in the settings. Prompt formats are available under llamacpp.prompt.*. You can then call .withTextPrompt(), .withInstructionPrompt() or .withChatPrompt() to use a standardized prompt.

const model = llamacpp
  .CompletionTextGenerator({
    // run https://huggingface.co/TheBloke/OpenHermes-2.5-Mistral-7B-GGUF with llama.cpp
    promptTemplate: llamacpp.prompt.ChatML,
    contextWindowSize: 4096,
    maxGenerationTokens: 512,
  })
  .withChatPrompt();

Changed

breaking change: renamed response to rawResponse when using fullResponse: true setting.
breaking change: renamed llamacpp.TextGenerator to llamacpp.CompletionTextGenerator.

Removed

breaking change: removed .withTextPromptTemplate on LlamaCppCompletionModel.

v0.117.0 - 2024-01-06

Added

Predefined Llama.cpp GBNF grammars:
- llamacpp.grammar.json: Restricts the output to JSON.
- llamacpp.grammar.jsonArray: Restricts the output to a JSON array.
- llamacpp.grammar.list: Restricts the output to a newline-separated list where each line starts with - .

Llama.cpp structure generation support:

const structure = await generateStructure(
  llamacpp
    .TextGenerator({
      // run openhermes-2.5-mistral-7b.Q4_K_M.gguf in llama.cpp
      maxGenerationTokens: 1024,
      temperature: 0,
    })
    .withTextPromptTemplate(ChatMLPrompt.instruction()) // needed for jsonStructurePrompt.text()
    .asStructureGenerationModel(jsonStructurePrompt.text()), // automatically restrict the output to JSON

  zodSchema(
    z.object({
      characters: z.array(
        z.object({
          name: z.string(),
          class: z
            .string()
            .describe("Character class, e.g. warrior, mage, or thief."),
          description: z.string(),
        })
      ),
    })
  ),

  "Generate 3 character descriptions for a fantasy role playing game. "
);

v0.116.0 - 2024-01-05

Added

Semantic classifier. An easy way to determine a class of a text using embeddings. Example:

import { SemanticClassifier, openai } from "modelfusion";

const classifier = new SemanticClassifier({
  embeddingModel: openai.TextEmbedder({
    model: "text-embedding-ada-002",
  }),
  similarityThreshold: 0.82,
  clusters: [
    {
      name: "politics" as const,
      values: [
        "isn't politics the best thing ever",
        "why don't you tell me about your political opinions",
        "don't you just love the president",
        "don't you just hate the president",
        "they're going to destroy this country!",
        "they will save the country!",
      ],
    },
    {
      name: "chitchat" as const,
      values: [
        "how's the weather today?",
        "how are things going?",
        "lovely weather today",
        "the weather is horrendous",
        "let's go to the chippy",
      ],
    },
  ],
});

console.log(await classifier.classify("don't you love politics?")); // politics
console.log(await classifier.classify("how's the weather today?")); // chitchat
console.log(
  await classifier.classify("I'm interested in learning about llama 2")
); // null

v0.115.0 - 2024-01-05

Removed

Anthropic support. Anthropic has a strong stance against open-source models and against non-US AI. I will not support them by providing a ModelFusion integration.

v0.114.1 - 2024-01-05

Fixed

Together AI text generation and text streaming using OpenAI-compatible chat models.

v0.114.0 - 2024-01-05

Added

Custom call header support for APIs. You can pass a customCallHeaders function into API configurations to add custom headers. The function is called with functionType, functionId, run, and callId parameters. Example for Helicone:

const text = await generateText(
  openai
    .ChatTextGenerator({
      api: new HeliconeOpenAIApiConfiguration({
        customCallHeaders: ({ functionId, callId }) => ({
          "Helicone-Property-FunctionId": functionId,
          "Helicone-Property-CallId": callId,
        }),
      }),
      model: "gpt-3.5-turbo",
      temperature: 0.7,
      maxGenerationTokens: 500,
    })
    .withTextPrompt(),

  "Write a short story about a robot learning to love",

  { functionId: "example-function" }
);

Rudimentary caching support for generateText. You can use a MemoryCache to store the response of a generateText call. Example:

import { MemoryCache, generateText, ollama } from "modelfusion";

const model = ollama
  .ChatTextGenerator({ model: "llama2:chat", maxGenerationTokens: 100 })
  .withTextPrompt();

const cache = new MemoryCache();

const text1 = await generateText(
  model,
  "Write a short story about a robot learning to love:",
  { cache }
);

console.log(text1);

// 2nd call will use cached response:
const text2 = await generateText(
  model,
  "Write a short story about a robot learning to love:", // same text
  { cache }
);

console.log(text2);

validateTypes and safeValidateTypes helpers that perform type checking of an object against a Schema (e.g., a zodSchema).

v0.113.0 - 2024-01-03

Structure generation improvements.

Added

.asStructureGenerationModel(...) function to OpenAIChatModel and OllamaChatModel to create structure generation models from chat models.
jsonStructurePrompt helper function to create structure generation models.

Example

import {
  generateStructure,
  jsonStructurePrompt,
  ollama,
  zodSchema,
} from "modelfusion";

const structure = await generateStructure(
  ollama
    .ChatTextGenerator({
      model: "openhermes2.5-mistral",
      maxGenerationTokens: 1024,
      temperature: 0,
    })
    .asStructureGenerationModel(jsonStructurePrompt.text()),

  zodSchema(
    z.object({
      characters: z.array(
        z.object({
          name: z.string(),
          class: z
            .string()
            .describe("Character class, e.g. warrior, mage, or thief."),
          description: z.string(),
        })
      ),
    })
  ),

  "Generate 3 character descriptions for a fantasy role playing game. "
);

v0.112.0 - 2024-01-02

Changed

breaking change: renamed useToolsOrGenerateText to useTools
breaking change: renamed generateToolCallsOrText to generateToolCalls

Removed

Restriction on tool names. OpenAI tool calls do not have such a restriction.

v0.111.0 - 2024-01-01

Reworked API configuration support.

Added

All providers now have an Api function that you can call to create custom API configurations. The base URL set up is more flexible and allows you to override parts of the base URL selectively.
api namespace with retry and throttle configurations

Changed

Updated Cohere models.
Updated LMNT API calls to LMNT v1 API.
breaking change: Renamed throttleUnlimitedConcurrency to throttleOff.

v0.110.0 - 2023-12-30

Changed

breaking change: renamed modelfusion/extension to modelfusion/internal. This requires updating modelfusion-experimental (if used) to v0.3.0

Removed

Deprecated OpenAI completion models that will be deactivated on January 4, 2024.

v0.109.0 - 2023-12-30

Added

Open AI compatible completion model. It e.g. works with Fireworks AI.

Together AI API configuration (for Open AI compatible chat models):

import {
  TogetherAIApiConfiguration,
  openaicompatible,
  streamText,
} from "modelfusion";

const textStream = await streamText(
  openaicompatible
    .ChatTextGenerator({
      api: new TogetherAIApiConfiguration(),
      model: "mistralai/Mixtral-8x7B-Instruct-v0.1",
    })
    .withTextPrompt(),

  "Write a story about a robot learning to love"
);

Updated Llama.cpp model settings. GBNF grammars can be passed into the grammar setting:

const text = await generateText(
  llamacpp
    .TextGenerator({
      maxGenerationTokens: 512,
      temperature: 0,
      // simple list grammar:
      grammar: `root ::= ("- " item)+
item ::= [^\\n]+ "\\n"`,
    })
    .withTextPromptTemplate(MistralInstructPrompt.text()),

  "List 5 ingredients for a lasagna:\n\n"
);

v0.107.0 - 2023-12-29

Added

Mistral instruct prompt template

Changed

breaking change: Renamed LlamaCppTextGenerationModel to LlamaCppCompletionModel.

Fixed

Updated LlamaCppCompletionModel to the latest llama.cpp version.
Fixed formatting of system prompt for chats in Llama2 2 prompt template.

v0.106.0 - 2023-12-28

Experimental features that are unlikely to become stable before v1.0 have been moved to a separate modelfusion-experimental package.

Removed

Cost calculation
guard function
Browser and server features (incl. flow)
summarizeRecursively function

v0.105.0 - 2023-12-26

Added

Tool call support for chat prompts. Assistant messages can contain tool calls, and tool messages can contain tool call results. Tool calls can be used to implement e.g. agents:

const chat: ChatPrompt = {
  system: "You are ...",
  messages: [ChatMessage.user({ text: instruction })],
};

while (true) {
  const { text, toolResults } = await useToolsOrGenerateText(
    openai
      .ChatTextGenerator({ model: "gpt-4-1106-preview" })
      .withChatPrompt(),
    tools, // array of tools
    chat
  );

  // add the assistant and tool messages to the chat:
  chat.messages.push(
    ChatMessage.assistant({ text, toolResults }),
    ChatMessage.tool({ toolResults })
  );

  if (toolResults == null) {
    return; // no more actions, break loop
  }

  // ... (handle tool results)
}

streamText returns a text promise when invoked with fullResponse: true. After the streaming has finished, the promise resolves with the full text.

const { text, textStream } = await streamText(
  openai.ChatTextGenerator({ model: "gpt-3.5-turbo" }).withTextPrompt(),
  "Write a short story about a robot learning to love:",
  { fullResponse: true }
);

// ... (handle streaming)

console.log(await text); // full text

v0.104.0 - 2023-12-24

Changed

breaking change: Unified text and multimodal prompt templates. [Text/MultiModal]InstructionPrompt is now InstructionPrompt, and [Text/MultiModalChatPrompt] is now ChatPrompt.
More flexible chat prompts: The chat prompt validation is now chat template specific and validated at runtime. E.g. the Llama2 prompt template only supports turns of user and assistant messages, whereas other formats are more flexible.

v0.103.0 - 2023-12-23

Added

finishReason support for generateText.

The finish reason can be stop (the model stopped because it generated a stop sequence), length (the model stopped because it generated the maximum number of tokens), content-filter (the model stopped because the content filter detected a violation), tool-calls (the model stopped because it triggered a tool call), error (the model stopped because of an error), other (the model stopped for another reason), or unknown (the model stop reason is not know or the model does not support finish reasons).

You can extract it from the full response when using fullResponse: true:
```
const { text, finishReason } = await generateText(
  openai
    .ChatTextGenerator({ model: "gpt-3.5-turbo", maxGenerationTokens: 200 })
    .withTextPrompt(),
  "Write a short story about a robot learning to love:",
  { fullResponse: true }
);
```

v0.102.0 - 2023-12-22

Added

You can specify numberOfGenerations on image generation models and create multiple images by using the fullResponse: true option. Example:

// generate 2 images:
const { images } = await generateImage(
  openai.ImageGenerator({
    model: "dall-e-3",
    numberOfGenerations: 2,
    size: "1024x1024",
  }),
  "the wicked witch of the west in the style of early 19th century painting",
  { fullResponse: true }
);

breaking change: Image generation models use a generalized numberOfGenerations parameter (instead of model specific parameters) to specify the number of generations.

v0.101.0 - 2023-12-22

Changed

Automatic1111 Stable Diffusion Web UI configuration has separate configuration of host, port, and path.

Fixed

Automatic1111 Stable Diffusion Web UI uses negative prompt and seed.

v0.100.0 - 2023-12-17

Added

ollama.ChatTextGenerator model that calls the Ollama chat API.
Ollama chat messages and prompts are exposed through ollama.ChatMessage and ollama.ChatPrompt
OpenAI chat messages and prompts are exposed through openai.ChatMessage and openai.ChatPrompt
Mistral chat messages and prompts are exposed through mistral.ChatMessage and mistral.ChatPrompt

Changed

breaking change: renamed ollama.TextGenerator to ollama.CompletionTextGenerator
breaking change: renamed mistral.TextGenerator to mistral.ChatTextGenerator

v0.99.0 - 2023-12-16

Added

You can specify numberOfGenerations on text generation models and access multiple generations by using the fullResponse: true option. Example:

// generate 2 texts:
const { texts } = await generateText(
  openai.CompletionTextGenerator({
    model: "gpt-3.5-turbo-instruct",
    numberOfGenerations: 2,
    maxGenerationTokens: 1000,
  }),
  "Write a short story about a robot learning to love:\n\n",
  { fullResponse: true }
);

breaking change: Text generation models use a generalized numberOfGenerations parameter (instead of model specific parameters) to specify the number of generations.

Changed

breaking change: Renamed maxCompletionTokens text generation model setting to maxGenerationTokens.

v0.98.0 - 2023-12-16

Changed

breaking change: responseType option was changed into fullResponse option and uses a boolean value to make discovery easy. The response values from the full response have been renamed for clarity. For base64 image generation, you can use the imageBase64 value from the full response:
```
const { imageBase64 } = await generateImage(model, prompt, {
  fullResponse: true,
});
```

Improved

Better docs for the OpenAI chat settings. Thanks @bearjaws for the contribution!

Fixed

Streaming OpenAI chat text generation when setting n:2 or higher returns only the stream from the first choice.

v0.97.0 - 2023-12-14

Added

breaking change: Ollama image (vision) support. This changes the Ollama prompt format. You can add .withTextPrompt() to existing Ollama text generators to get a text prompt like before.

Vision example:

import { ollama, streamText } from "modelfusion";

const textStream = await streamText(
  ollama.TextGenerator({
    model: "bakllava",
    maxCompletionTokens: 1024,
    temperature: 0,
  }),
  {
    prompt: "Describe the image in detail",
    images: [image], // base-64 encoded png or jpeg
  }
);

Changed

breaking change: Switch Ollama settings to camelCase to align with the rest of the library.

v0.96.0 - 2023-12-14

Added

Mistral platform support

v0.95.0 - 2023-12-10

Added

cachePrompt parameter for llama.cpp models. Thanks @djwhitt for the contribution!

v0.94.0 - 2023-12-10

Added

Prompt template for neural-chat models.

v0.93.0 - 2023-12-10

Added

Optional response prefix for instruction prompts to guide the LLM response.

Changed

breaking change: Renamed prompt format to prompt template to align with the commonly used language (e.g. from model cards).

v0.92.1 - 2023-12-10

Changed

Improved Ollama error handling.

v0.92.0 - 2023-12-09

Changed

breaking change: setting global function observers and global logging has changed. You can call methods on a modelfusion import:
```
import { modelfusion } from "modelfusion";

modelfusion.setLogFormat("basic-text");
```
Cleaned output when using detailed-object log format.

v0.91.0 - 2023-12-09

Added

Whisper.cpp transcription (speech-to-text) model support.

import { generateTranscription, whispercpp } from "modelfusion";

const data = await fs.promises.readFile("data/test.wav");

const transcription = await generateTranscription(whispercpp.Transcriber(), {
  type: "wav",
  data,
});

Improved

Better error reporting.

v0.90.0 - 2023-12-03

Added

Temperature and language settings to OpenAI transcription model.

v0.89.0 - 2023-11-30

Added

maxValuesPerCall setting for OpenAITextEmbeddingModel to enable different configurations, e.g. for Azure. Thanks @nanotronic for the contribution!

v0.88.0 - 2023-11-28

Added

Multi-modal chat prompts. Supported by OpenAI vision chat models and by BakLLaVA prompt format.

Changed

breaking change: renamed ChatPrompt to TextChatPrompt to distinguish it from multi-modal chat prompts.

v0.87.0 - 2023-11-27

Added

experimental: modelfusion/extension export with functions and classes that are necessary to implement providers in 3rd party node modules. See lgrammel/modelfusion-example-provider for an example.

v0.85.0 - 2023-11-26

Added

OpenAIChatMessage function call support.

v0.84.0 - 2023-11-26

Added

Support for OpenAI-compatible chat APIs. See OpenAI Compatible for details.

import {
  BaseUrlApiConfiguration,
  openaicompatible,
  generateText,
} from "modelfusion";

const text = await generateText(
  openaicompatible
    .ChatTextGenerator({
      api: new BaseUrlApiConfiguration({
        baseUrl: "https://api.fireworks.ai/inference/v1",
        headers: {
          Authorization: `Bearer ${process.env.FIREWORKS_API_KEY}`,
        },
      }),
      model: "accounts/fireworks/models/mistral-7b",
    })
    .withTextPrompt(),

  "Write a story about a robot learning to love"
);

v0.83.0 - 2023-11-26

Added

Introduce uncheckedSchema() facade function as an easier way to create unchecked ModelFusion schemas. This aligns the API with zodSchema().

Changed

breaking change: Renamed InstructionPrompt interface to MultiModalInstructionPrompt to clearly distinguish it from TextInstructionPrompt.
breaking change: Renamed .withBasicPrompt methods for image generation models to .withTextPrompt to align with text generation models.

v0.82.0 - 2023-11-25

Added

Introduce zodSchema() facade function as an easier way to create new ModelFusion Zod schemas. This clearly distinguishes it from ZodSchema that is also part of the zod library.

v0.81.0 - 2023-11-25

breaking change: generateStructure and streamStructure redesign. The new API does not require function calling and StructureDefinition objects any more. This makes it more flexible and it can be used in 3 ways:

with OpenAI function calling:

const model = openai
  .ChatTextGenerator({ model: "gpt-3.5-turbo" })
  .asFunctionCallStructureGenerationModel({
    fnName: "...",
    fnDescription: "...",
  });

with OpenAI JSON format:

const model = openai
  .ChatTextGenerator({
    model: "gpt-4-1106-preview",
    temperature: 0,
    maxCompletionTokens: 1024,
    responseFormat: { type: "json_object" },
  })
  .asStructureGenerationModel(
    jsonStructurePrompt((instruction: string, schema) => [
      OpenAIChatMessage.system(
        "JSON schema: \n" +
          JSON.stringify(schema.getJsonSchema()) +
          "\n\n" +
          "Respond only using JSON that matches the above schema."
      ),
      OpenAIChatMessage.user(instruction),
    ])
  );

with Ollama (and a capable model, e.g., OpenHermes 2.5):

const model = ollama
  .TextGenerator({
    model: "openhermes2.5-mistral",
    maxCompletionTokens: 1024,
    temperature: 0,
    format: "json",
    raw: true,
    stopSequences: ["\n\n"], // prevent infinite generation
  })
  .withPromptFormat(ChatMLPromptFormat.instruction())
  .asStructureGenerationModel(
    jsonStructurePrompt((instruction: string, schema) => ({
      system:
        "JSON schema: \n" +
        JSON.stringify(schema.getJsonSchema()) +
        "\n\n" +
        "Respond only using JSON that matches the above schema.",
      instruction,
    }))
  );

See generateStructure for details on the new API.

v0.80.0 - 2023-11-24

Changed

breaking change: Restructured multi-modal instruction prompts and OpenAIChatMessage.user()

v0.79.0 - 2023-11-23

Added

Multi-tool usage from open source models

Use TextGenerationToolCallsOrGenerateTextModel and related helper methods .asToolCallsOrTextGenerationModel() to create custom prompts & parsers.

Examples:
- examples/basic/src/model-provider/ollama/ollama-use-tools-or-generate-text-openhermes-example.ts
- examples/basic/src/model-provider/llamacpp/llamacpp-use-tools-or-generate-text-openhermes-example.ts
Example prompt format:
- examples/basic/src/tool/prompts/open-hermes.ts for OpenHermes 2.5

v0.78.0 - 2023-11-23

Removed

breaking change: Removed FunctionListToolCallPromptFormat. See examples/basic/src/model-provide/ollama/ollama-use-tool-mistral-example.ts for how to implement a ToolCallPromptFormat for your tool.

v0.77.0 - 2023-11-23

Changed

breaking change: Rename Speech to SpeechGenerator in facades
breaking change: Rename Transcription to Transcriber in facades

v0.76.0 - 2023-11-23

Added

Anthropic Claude 2.1 support

v0.75.0 - 2023-11-22

Introducing model provider facades:

const image = await generateImage(
  openai.ImageGenerator({ model: "dall-e-3", size: "1024x1024" }),
  "the wicked witch of the west in the style of early 19th century painting"
);

Added

Model provider facades. You can e.g. use ollama.TextGenerator(...) instead of new OllamaTextGenerationModel(...).

Changed

breaking change: Fixed method name isParallizable to isParallelizable in EmbeddingModel.

Removed

breaking change: removed HuggingFaceImageDescriptionModel. Image description models will be replaced by multi-modal vision models.

v0.74.1 - 2023-11-22

Improved

Increase OpenAI chat streaming resilience.

v0.74.0 - 2023-11-21

Prompt format and tool calling improvements.

Added

text prompt format. Use simple text prompts, e.g. with OpenAIChatModel:

const textStream = await streamText(
  new OpenAIChatModel({
    model: "gpt-3.5-turbo",
  }).withTextPrompt(),
  "Write a short story about a robot learning to love."
);

.withTextPromptFormat to LlamaCppTextGenerationModel for simplified prompt construction:

const textStream = await streamText(
  new LlamaCppTextGenerationModel({
    // ...
  }).withTextPromptFormat(Llama2PromptFormat.text()),
  "Write a short story about a robot learning to love."
);

.asToolCallGenerationModel() to OllamaTextGenerationModel to simplify tool calls.

Improved

better error reporting when using exponent backoff retries

Removed

breaking change: removed input from InstructionPrompt (was Alpaca-specific, AlpacaPromptFormat still supports it)

v0.73.1 - 2023-11-19

Remove section newlines from Llama 2 prompt format.

v0.73.0 - 2023-11-19

Ollama edge case and error handling improvements.

v0.72.0 - 2023-11-19

Breaking change: the tool calling API has been reworked to support multiple parallel tool calls. This required multiple breaking changes (see below). Check out the updated tools documentation for details.

Changed

Tool has parameters and returnType schemas (instead of inputSchema and outputSchema).
useTool uses generateToolCall under the hood. The return value and error handling has changed.
useToolOrGenerateText has been renamed to useToolsOrGenerateText. It uses generateToolCallsOrText under the hood. The return value and error handling has changed. It can invoke several tools in parallel and returns an array of tool results.
The maxRetries parameter in guard has been replaced by a maxAttempt parameter.

Removed

generateStructureOrText has been removed.

v0.71.0 - 2023-11-17

Added

Experimental generateToolCallsOrText function for generating a multiple parallel tool call using the OpenAI chat/tools API.

v0.70.0 - 2023-11-16

Added

ChatML prompt format.

Changed

breaking change: ChatPrompt structure and terminology has changed to align more closely with OpenAI and similar chat prompts. This is also in preparation for integrating images and function calls results into chat prompts.
breaking change: Prompt formats are namespaced. Use e.g. Llama2PromptFormat.chat() instead of mapChatPromptToLlama2Format(). See Prompt Format for documentation of the new prompt formats.

v0.69.0 - 2023-11-15

Added

Experimental generateToolCall function for generating a single tool call using the OpenAI chat/tools API.

v0.68.0 - 2023-11-14

Changed

Refactored JSON parsing to use abstracted schemas. You can use parseJSON and safeParseJSON to securely parse JSON objects and optionally type-check them using any schema (e.g. a Zod schema).

v0.67.0 - 2023-11-12

Added

Ollama 0.1.9 support: format (for forcing JSON output) and raw settings
Improved Ollama settings documentation

v0.66.0 - 2023-11-12

Added

Support for fine-tuned OpenAI gpt-4-0613 models
Support for trimWhitespace model setting in streamText calls

v0.65.0 - 2023-11-12

Added

Image support for OpenAIChatMessage.user
mapInstructionPromptToBakLLaVA1ForLlamaCppFormat prompt format

Changed

breaking change: VisionInstructionPrompt was replaced by an optional image field in InstructionPrompt.

v0.64.0 - 2023-11-11

Added

Support for OpenAI vision model.
- Example: examples/basic/src/model-provider/openai/openai-chat-stream-text-vision-example.ts

v0.63.0 - 2023-11-08

Added

Support for OpenAI chat completion seed and responseFormat options.

v0.62.0 - 2023-11-08

Added

OpenAI speech generation support. Shoutout to @bjsi for the awesome contribution!

v0.61.0 - 2023-11-07

Added

OpenAI gpt-3.5-turbo-1106, gpt-4-1106-preview, gpt-4-vision-preview chat models.
OpenAI Dalle-E-3 image model.

Changed

breaking change: OpenAIImageGenerationModel requires a model parameter.

v0.60.0 - 2023-11-06

Added

Support image input for multi-modal Llama.cpp models (e.g. Llava, Bakllava).

Changed

breaking change: Llama.cpp prompt format has changed to support images. Use .withTextPrompt() to get a text prompt format.

v0.59.0 - 2023-11-06

Added

ElevenLabs eleven_turbo_v2 support.

v0.58 - 2023-11-05

Fixed

breaking change: Uncaught errors were caused by custom Promises. ModelFusion uses only standard Promises. To get full responses from model function, you need to use the { returnType: "full" } option instead of calling .asFullResponse() on the result.

v0.57.1 - 2023-11-05

Improved

ModelFusion server error logging and reporting.

Fixed

ModelFusion server creates directory for runs automatically when errors are thrown.

v0.57.0 - 2023-11-04

Added

Support for Cohere v3 embeddings.

v0.56.0 - 2023-11-04

Added

Ollama model provider for text embeddings.

v0.55.1 - 2023-11-04

Fixed

Llama.cpp embeddings are invoked sequentially to avoid rejection by the server.

v0.55.0 - 2023-11-04

Added

Ollama model provider for text generation and text streaming.

v0.54.0 - 2023-10-29

Adding experimental ModelFusion server, flows, and browser utils.

Added

ModelFusion server (separate export 'modelfusion/server') with a Fastify plugin for running ModelFusion flows on a server.
ModelFusion flows.
ModelFusion browser utils (separate export 'modelfusion/browser') for dealing with audio data and invoking ModelFusion flows on the server (invokeFlow).

Changed

breaking change: readEventSource and readEventSourceStream are part of 'modelfusion/browser'.

v0.53.2 - 2023-10-26

Added

Prompt callback option for streamStructure

Improved

Inline JSDoc comments for the model functions.

v0.53.1 - 2023-10-25

Fixed

Abort signals and errors during streaming are caught and forwarded correctly.

v0.53.0 - 2023-10-23

Added

executeFunction utility function for tracing execution time, parameters, and result of composite functions and non-ModelFusion functions.

v0.52.0 - 2023-10-23

Changed

Streaming results and AsyncQueue objects can be used by several consumers. Each consumer will receive all values. This means that you can e.g. forward the same text stream to speech generation and the client.

v0.51.0 - 2023-10-23

ElevenLabs improvements.

Added

ElevenLabs model settings outputFormat and optimizeStreamingLatency.

Fixed

Default ElevenLabs model is eleven_monolingual_v1.

v0.50.0 - 2023-10-22

Added

parentCallId event property
Tracing for useTool, useToolOrGenerateText, upsertIntoVectorIndex, and guard

Changed

breaking change: rename embedding event type to embed
breaking change: rename image-generation event type to generate-image
breaking change: rename speech-generation event type to generate-speech
breaking change: rename speech-streaming event type to stream-speech
breaking change: rename structure-generation event type to generate-structure
breaking change: rename structure-or-text-generation event type to generate-structure-or-text
breaking change: rename structure-streaming event type to stream-structure
breaking change: rename text-generation event type to generate-text
breaking change: rename text-streaming event type to stream-text
breaking change: rename transcription event type to generate-transcription

v0.49.0 - 2023-10-21

Added

Speech synthesis streaming supports string inputs.
Observability for speech synthesis streaming.

Changed

breaking change: split synthesizeSpeech into generateSpeech and streamSpeech functions
breaking change: renamed speech-synthesis event to speech-generation
breaking change: renamed transcribe to generateTranscription
breaking change: renamed LmntSpeechSynthesisModel to LmntSpeechModel
breaking change: renamed ElevenLabesSpeechSynthesisModel to ElevenLabsSpeechModel
breaking change: renamed OpenAITextGenerationModel to OpenAICompletionModel

Removed

breaking change: describeImage model function. Use generateText instead (with e.g. HuggingFaceImageDescriptionModel).

v0.48.0 - 2023-10-20

Added

Duplex streaming for speech synthesis.
Elevenlabs duplex streaming support.

Changed

Schema is using data in return type (breaking change for tools).

v0.47.0 - 2023-10-14

Added

Prompt formats for image generation. You can use .withPromptFormat() or .withBasicPrompt() to apply a prompt format to an image generation model.

Changed

breaking change: generateImage returns a Buffer with the binary image data instead of a base-64 encoded string. You can call .asBase64Text() on the response to get a base64 encoded string.

v0.46.0 - 2023-10-14

Added

.withChatPrompt() and .withInstructionPrompt() shorthand methods.

v0.45.0 - 2023-10-14

Changed

Updated Zod to 3.22.4. You need to use Zod 3.22.4 or higher in your project.

v0.44.0 - 2023-10-13

Added

Store runs in AsyncLocalStorage for convienience (Node.js only).

v0.43.0 - 2023-10-12

Added

Guard function.

v0.42.0 - 2023-10-11

Added

Anthropic model support (Claude 2, Claude instant).

v0.41.0 - 2023-10-05

Changed

breaking change: generics simplification to enable dynamic model usage. Models can be used more easily as function parameters.

output renamed to value in asFullResponse()
model settings can no longer be configured as a model options parameter. Use .withSettings() instead.

v0.40.0 - 2023-10-04

Changed

breaking change: moved Pinecone integration into @modelfusion/pinecone module.

v0.39.0 - 2023-10-03

Added

readEventSource for parsing a server-sent event stream using the JavaScript EventSource.

Changed

breaking change: generalization to use Schema instead of Zod.

MemoryVectorIndex.deserialize requires a Schema, e.g. new ZodSchema (from ModelFusion).
readEventSourceStream requires a Schema.
UncheckedJsonSchema[Schema/StructureDefinition] renamed to Unchecked[Schema/StructureDefinition].

v0.38.0 - 2023-10-02

Changed

breaking change: Generalized embeddings beyond text embedding.

embedText renamed to embed.
embedTexts renamed to embedMany
Removed filtering from VectorIndexRetriever query (still available as a setting).

v0.37.0 - 2023-10-02

Added

VectorIndexRetriever supports a filter option that is passed to the vector index.
MemoryVectorIndex supports filter functions that are applied to the objects before calculating the embeddings.

v0.36.0 - 2023-10-02

Added

basic-text logger logs function ids when available.
retrieve produces events for logging and observability.

v0.35.2 - 2023-09-27

Fixed

Support empty stop sequences when calling OpenAI text and chat models.

v0.35.1 - 2023-09-27

Fixed

Fixed bugs in streamStructure partial JSON parsing.

v0.35.0 - 2023-09-26

Added

streamStructure for streaming structured responses, e.g. from OpenAI function calls. Thanks @bjsi for the input!

v0.34.0 - 2023-09-25

Added

First version of event source utilities: AsyncQueue, createEventSourceStream, readEventSourceStream.

v0.33.1 - 2023-09-24

Fixed

Remove resolution part from type definitions.

v0.33.0 - 2023-09-19

Changed

breaking change: Generalized vector store upsert/retrieve beyond text chunks:

upsertTextChunks renamed to upsertIntoVectorStore. Syntax has changed.
retrieveTextChunks renamed to retrieve
SimilarTextChunksFromVectorIndexRetriever renamed to VectorIndexRetriever

v0.32.0 - 2023-09-19

Added

OpenAI gpt-3.5-turbo-instruct model support.
Autocomplete for Stability AI models (thanks @Danielwinkelmann!)

Changed

Downgrade Zod version to 3.21.4 because of colinhacks/zod#2697

v0.31.0 - 2023-09-13

Changed

breaking change: Renamed chat format construction functions to follow the pattern map[Chat|Instruction]PromptTo[FORMAT]Format(), e.g. mapInstructionPromptToAlpacaFormat(), for easy auto-completion.

Removed

breaking change: The prompts for generateStructure and generateStructureOrText have been simplified. You can remove the OpenAIChatPrompt.forStructureCurried (and similar) parts.

v0.30.0 - 2023-09-10

Added

You can directly pass JSON schemas into generateStructure and generateStructureOrText calls without validation using UncheckedJsonSchemaStructureDefinition. This is useful when you need more flexility and don't require type inference. See examples/basic/src/util/schema/generate-structure-unchecked-json-schema-example.ts.

Changed

BREAKING CHANGE: renamed generateJson and generateJsonOrText to generateStructure and generateStructureOrText.
BREAKING CHANGE: introduced ZodSchema and ZodStructureDefinition. These are required for generateStructure and generateStructureOrText calls and in tools.
BREAKING CHANGE: renamed the corresponding methods and objects.

Why this breaking change?

ModelFusion is currently tied to Zod, but there are many other type checking libraries out there, and Zod does not map perfectly to JSON Schema (which is used in OpenAI function calling). Enabling you to use JSON Schema directly in ModelFusion is a first step towards decoupling ModelFusion from Zod. You can also configure your own schema adapters that e.g. use Ajv or another library. Since this change already affected all JSON generation calls and tools, I included other changes that I had planned in the same area (e.g., renaming to generateStructure and making it more consistent).

v0.29.0 - 2023-09-09

Added

describeImage model function for image captioning and OCR. HuggingFace provider available.

v0.28.0 - 2023-09-09

Added

BaseUrlApiConfiguration class for setting up API configurations with custom base URLs and headers.

v0.27.0 - 2023-09-07

Added

Support for running OpenAI on Microsoft Azure.

Changed

Breaking change: Introduce API configuration. This affects setting the baseUrl, throttling, and retries.
Improved Helicone support via HeliconeOpenAIApiConfiguration.

v0.26.0 - 2023-09-06

Added

LMNT speech synthesis support.

v0.25.0 - 2023-09-05

Changed

Separated cost calculation from Run.

v0.24.1 - 2023-09-04

Added

Exposed logitBias setting for OpenAI chat and text generation models.

v0.24.0 - 2023-09-02

Added

Support for fine-tuned OpenAI models (for the davinci-002, babbage-002, and gpt-3.5-turbo base models).

v0.23.0 - 2023-08-31

Added

Function logging support.
Usage information for events.
Filtering of model settings for events.

v0.22.0 - 2023-08-28

Changed

Breaking change: Restructured the function call events.

v0.21.0 - 2023-08-26

Changed

Breaking change: Reworked the function observer system. See Function observers for details on how to use the new system.

v0.20.0 - 2023-08-24

Changed

Breaking change: Use .asFullResponse() to get full responses from model functions (replaces the fullResponse: true option).

v0.19.0 - 2023-08-23

Added

Support for "babbage-002" and "davinci-002" OpenAI base models.

Fixed

Choose correct tokenizer for older OpenAI text models.

v0.18.0 - 2023-08-22

Added

Support for ElevenLabs speech synthesis parameters.

v0.17.0 - 2023-08-21

Added

generateSpeech function to generate speech from text.
ElevenLabs support.

v0.15.0 - 2023-08-21

Changed

Introduced unified stopSequences and maxCompletionTokens properties for all text generation models. Breaking change: maxCompletionTokens and stopSequences are part of the base TextGenerationModel. Specific names for these properties in models have been replaced by this, e.g. maxTokens in OpenAI models is maxCompletionTokens.

v0.14.0 - 2023-08-17

Changed

Breaking change: Renamed prompt mappings (and related code) to prompt format.
Improved type inference for WebSearchTool and executeTool.

v0.12.0 - 2023-08-15

Added

JsonTextGenerationModel and InstructionWithSchemaPrompt to support generateJson on text generation models.

v0.11.0 - 2023-08-14

Changed

WebSearchTool signature updated.

v0.10.0 - 2023-08-13

Added

Convenience functions to create OpenAI chat messages from tool calls and results.

v0.9.0 - 2023-08-13

Added

WebSearchTool definition to support the SerpAPI tool (separate package: @modelfusion/serpapi-tools)

v0.8.0 - 2023-08-12

Added

executeTool function that directly executes a single tool and records execution metadata.

Changed

Reworked event system and introduced RunFunctionEvent.

v0.7.0 - 2023-08-10

Changed

Breaking change: Model functions return a simple object by default to make the 95% use case easier. You can use the fullResponse option to get a richer response object that includes the original model response and metadata.

v0.6.0 - 2023-08-07

Added

splitTextChunk function.

Changed

Breaking change: Restructured text splitter functions.

v0.5.0 - 2023-08-07

Added

splitTextChunks function.
Chat with PDF demo.

Changed

Breaking change: Renamed VectorIndexSimilarTextChunkRetriever to SimilarTextChunksFromVectorIndexRetriever.
Breaking change: Renamed 'content' property in TextChunk to 'text.

Removed

VectorIndexTextChunkStore

v0.4.1 - 2023-08-06

Fixed

Type inference bug in trimChatPrompt.

v0.4.0 - 2023-08-06

Added

HuggingFace text embedding support.

v0.3.0 - 2023-08-05

Added

Helicone observability integration.

v0.2.0 - 2023-08-04

Added

Instruction prompts can contain optional input property.
Alpaca instruction prompt mapping.
Vicuna chat prompt mapping.

v0.1.1 - 2023-08-02

Changed

Docs updated to ModelFusion.

v0.1.0 - 2023-08-01

Changed

Breaking Change: Renamed to modelfusion (from ai-utils.js).

v0.0.43 - 2023-08-01

Changed

Breaking Change: model functions return rich objects that include the result, the model response and metadata. This enables you to access the original model response easily when you need it and also use the metadata outside of runs.

v0.0.42 - 2023-07-31

Added

trimChatPrompt() function to fit chat prompts into the context window and leave enough space for the completion.
maxCompletionTokens property on TextGenerationModels.

Changed

Renamed withMaxTokens to withMaxCompletionTokens on TextGenerationModels.

Removed

composeRecentMessagesOpenAIChatPrompt function (use trimChatPrompt instead).

v0.0.41 - 2023-07-30

Added

ChatPrompt concept (with chat prompt mappings for text, OpenAI chat, and Llama 2 prompts).

Changed

Renamed prompt mappings and changed into functions.

v0.0.40 - 2023-07-30

Added

Prompt mapping support for text generation and streaming.
Added instruction prompt concept and mapping.
Option to specify context window size for Llama.cpp text generation models.

Changed

Renamed 'maxTokens' to 'contextWindowSize' where applicable.
Restructured how tokenizers are exposed by text generation models.

v0.0.39 - 2023-07-26

Added

llama.cpp embedding support.

v0.0.38 - 2023-07-24

Changed

zod and zod-to-json-schema are peer dependencies and no longer included in the package.

v0.0.37 - 2023-07-23

Changed

generateJsonOrText, useToolOrGenerateText, useTool return additional information in the response (e.g. the parameters and additional text).

v0.0.36 - 2023-07-23

Changed

Renamed callTool to useTool and callToolOrGenerateText to useToolOrGenerateText.

v0.0.35 - 2023-07-22

Added

generateJsonOrText
Tools: Tool class, callTool, callToolOrGenerateText

Changed

Restructured "generateJson" arguments.

v0.0.34 - 2023-07-18

Removed

asFunction model function variants. Use JavaScript lamba functions instead.

v0.0.33 - 2023-07-18

Added

OpenAIChatAutoFunctionPrompt to call the OpenAI functions API with multiple functions in 'auto' mode.

v0.0.32 - 2023-07-15

Changed

Changed the prompt format of the generateJson function.

v0.0.31 - 2023-07-14

Changed

Reworked interaction with vectors stores. Removed VectorDB, renamed VectorStore to VectorIndex, and introduced upsertTextChunks and retrieveTextChunks functions.

v0.0.30 - 2023-07-13

Fixed

Bugs related to performance. not being available.

v0.0.29 - 2023-07-13

Added

Llama.cpp tokenization support.

Changed

Split Tokenizer API into BasicTokenizer and FullTokenizer.
Introduce countTokens function (replacing Tokenizer.countTokens).

v0.0.28 - 2023-07-12

Added

Events for streamText.

v0.0.27 - 2023-07-11

Added

TextDeltaEventSource for Client/Server streaming support.

Fixed

End-of-stream bug in Llama.cpp text streaming.

v0.0.26 - 2023-07-11

Added

Streaming support for Cohere text generation models.

v0.0.25 - 2023-07-10

Added

Streaming support for OpenAI text completion models.
OpenAI function streaming support (in low-level API).

v0.0.24 - 2023-07-09

Added

Generalized text streaming (async string iterable, useful for command line streaming).
Streaming support for Llama.cpp text generation.

v0.0.23 - 2023-07-08

Added

Llama.cpp text generation support.

v0.0.22 - 2023-07-08

Changed

Convert all main methods (e.g. model.generateText(...)) to a functional API (i.e., generateText(model, ...)).

v0.0.21 - 2023-07-07

New

JSON generation model.

v0.0.20 - 2023-07-02

New

Automatic1111 image generation provider.

v0.0.19 - 2023-06-30

New

Cost calculation for OpenAI image generation and transcription models.

v0.0.18 - 2023-06-28

New

Cost calculation for Open AI text generation, chat and embedding models.

Changed

Renamed RunContext to Run. Introduced DefaultRun.
Changed events and observers.

v0.0.17 - 2023-06-14

New

Updated OpenAI models.
Low-level support for OpenAI chat functions API (via OpenAIChatModel.callApi).
TranscriptionModel and OpenAITranscriptionModel (using whisper)

Changed

Single optional parameter for functions/method that contains run, functionId, etc.

v0.0.16 - 2023-06-13

Fixed

Retry is not attempted when you ran out of OpenAI credits.
Vercel edge function support (switched to nanoid for unique IDs).

Changed

Improved OpenAI chat streaming API.
Changed asFunction variants from namespaced functions into stand-alone functions.

v0.0.15 - 2023-06-12

Changed

Documentation update.

v0.0.14 - 2023-06-11

Changed

Major rework of embedding APIs.

v0.0.13 - 2023-06-10

Changed

Major rework of text and image generation APIs.

v0.0.12 - 2023-06-06

v0.0.11 - 2023-06-05

Changed

Various renames.

v0.0.10 - 2023-06-04

New

Pinecone VectorDB support
Cohere tokenization support

v0.0.9 - 2023-06-03

New

OpenAI DALL-E image generation support
generateImage function
Throttling and retries on model level

v0.0.8 - 2023-06-02

New

Stability AI image generation support
Image generation Next.js example

Changed

Updated PDF to tweet example with style transfer

v0.0.7 - 2023-06-01

New

Hugging Face text generation support
Memory vector DB

v0.0.6 - 2023-05-31

New

Cohere embedding API support

Changes

Restructured retry logic
embed embeds many texts at once

v0.0.5 - 2023-05-30

New

Cohere text generation support
OpenAI chat streams can be returned as delta async iterables
Documentation of integration APIs and models

v0.0.4 - 2023-05-29

New

OpenAI embedding support
Text embedding functions
Chat streams can be returned as ReadableStream or AsyncIterable
Basic examples under examples/basic
Initial documentation available at modelfusion.dev

v0.0.3 - 2023-05-28

New

Voice recording and transcription Next.js app example.
OpenAI transcription support (Whisper).

v0.0.2 - 2023-05-27

New

BabyAGI Example in TypeScript
TikToken for OpenAI: We've added tiktoken to aid in tokenization and token counting, including those for message and prompt overhead tokens in chat.
Tokenization-based Recursive Splitter: A new splitter that operates recursively using tokenization.
Prompt Management Utility: An enhancement to fit recent chat messages into the context window.

v0.0.1 - 2023-05-26

New

AI Chat Example using Next.js: An example demonstrating AI chat implementation using Next.js.
PDF to Twitter Thread Example: This shows how a PDF can be converted into a Twitter thread.
OpenAI Chat Completion Streaming Support: A feature providing real-time response capabilities using OpenAI's chat completion streaming.
OpenAI Chat and Text Completion Support: This addition enables the software to handle both chat and text completions from OpenAI.
Retry Management: A feature to enhance resilience by managing retry attempts for tasks.
Task Progress Reporting and Abort Signals: This allows users to track the progress of tasks and gives the ability to abort tasks when needed.
Recursive Character Splitter: A feature to split text into characters recursively for more detailed text analysis.
Recursive Text Mapping: This enables recursive mapping of text, beneficial for tasks like summarization or extraction.
Split-Map-Filter-Reduce for Text Processing: A process chain developed for sophisticated text handling, allowing operations to split, map, filter, and reduce text data.

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

v0.137.0 - 2024-02-24

Changed

v0.136.0 - 2024-02-07

Added

v0.135.1 - 2024-02-04

Fixed

v0.135.0 - 2024-01-29

Added

Changed

v0.134.0 - 2024-01-28

Added

Changed

v0.133.0 - 2024-01-26

Added

Changed

v0.132.0 - 2024-01-25

Added

v0.131.1 - 2024-01-25

Fixed

v0.131.0 - 2024-01-23

Added

Changed

v0.130.1 - 2024-01-22

Fixed

v0.130.0 - 2024-01-21

Changed

v0.129.0 - 2024-01-20

Changed

v0.128.0 - 2024-01-20

Changed

v0.127.0 - 2024-01-15

Changed

Fixed

v0.126.0 - 2024-01-15

Changed

v0.125.0 - 2024-01-14

Added

v0.124.0 - 2024-01-13

Added

v0.123.0 - 2024-01-13

Added

v0.122.0 - 2024-01-13

Changed

v0.121.2 - 2024-01-11

Fixed

v0.121.1 - 2024-01-10

Fixed

v0.121.0 - 2024-01-09

Added

Changed

v0.120.0 - 2024-01-09

Added

Removed

v0.119.1 - 2024-01-08

Fixed

v0.119.0 - 2024-01-07

Added

v0.118.0 - 2024-01-07

Added

Changed

Removed

v0.117.0 - 2024-01-06

Added

v0.116.0 - 2024-01-05

Added

v0.115.0 - 2024-01-05

Removed

v0.114.1 - 2024-01-05

Fixed

v0.114.0 - 2024-01-05

Added

v0.113.0 - 2024-01-03

Added