node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

v3.14.2 URL: https://unpkg.com/node-llama-cpp@3.14.2/dist/index.js

Open Browse Files

@basetenlabs/performance-client

This library provides a high-performance Node.js client for Baseten.co endpoints including embeddings, reranking, and classification. It was built for massive concurrent POST requests to any URL, also outside of baseten.co. The PerformanceClient is built

v0.0.10 URL: https://unpkg.com/@basetenlabs/performance-client@0.0.10/index.js

Open Browse Files

baseten performance client embedding reranking classification

n8n-nodes-query-retriever-rerank

Advanced n8n community node for intelligent document retrieval with multi-step reasoning, reranking, and comprehensive debugging

v0.4.1 URL: https://unpkg.com/n8n-nodes-query-retriever-rerank@0.4.1/index.js

Open Browse Files

n8n-community-node-package vector-store ai langchain question-answer retrieval reranking multi-step-reasoning progressive-query document-analysis llamaindex-inspired

n8n-nodes-contextualai

n8n community node for Contextual AI - enterprise-grade RAG agents, document parsing, querying, reranking, and evaluation

v0.1.9 URL: https://unpkg.com/n8n-nodes-contextualai@0.1.9/index.js

Open Browse Files

n8n-community-node-package contextual-ai rag document-parsing reranking lmunit

llama-cpp-capacitor

A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with chat-first API design. Supports both simple text generation and advanced chat conversations with system prompts, multimodal processing, TTS, LoRA

v0.0.22 URL: https://unpkg.com/llama-cpp-capacitor@0.0.22/dist/plugin.js

Open Browse Files

capacitor plugin native llama llama.cpp ai machine-learning offline-ai text-generation multimodal tts text-to-speech lora embeddings reranking chat-completion gguf large-language-model llm