AgentCC is a discovery hub for AI agent capabilities. We index Agent Skills as the primary dataset, and also organize MCP servers and selected AI tools so developers can quickly find, compare, and evaluate what to integrate into their workflows.

What is an Agent Skill?

An Agent Skill is a reusable capability package for an AI agent. It can define workflows, tool usage patterns, domain knowledge, or execution rules that help the agent perform more reliably in a specific task or environment.

How do I use a skill listed on AgentCC?

AgentCC is a resource and discovery layer, not a one-click installer. On each skill page, you should review the repository, file tree, install command, and usage notes, then integrate it according to the runtime or client you are using.

Does AgentCC host or execute skill code for me?

No. AgentCC primarily indexes external repositories and metadata. We help you understand what a skill is, where it comes from, and how it may be used, but execution and security decisions still belong to your own runtime environment.

Are all listed skills safe?

No directory can guarantee absolute safety. AgentCC can help surface repository links, file structures, and metadata, but you should still verify permissions, external dependencies, API usage, and code quality before using a skill in production or on sensitive machines.

Can I submit my own skill, MCP server, or tool?

Yes. AgentCC is designed to be an evolving resource graph. As the submission and curation workflow matures, contributors will be able to recommend high-quality skills, MCP servers, and AI tools into the directory.

主站 Developer Toolstensorrt-llm

tensorrt-llm

Name: tensorrt-llm
Author: davila7

Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.

21.8k Stars0 Installs更新于 95 days agoMIT

Inference Serving TensorRT-LLM NVIDIA Inference Optimization High Throughput Low Latency Production FP8 INT4 In-Flight Batching Multi-GPU

About tensorrt-llm

通过NVIDIA TensorRT优化LLM推理，以实现最大吞吐量和最低延迟。在需要比PyTorch快10-100倍的推理时，或用于具有量化（FP8/INT4）、飞行批处理和多GPU扩展的模型服务时，使用NVIDIA GPUs（A100/H100）进行生产部署。

Category: developer (开发工具) · Author: davila7 · Version: @main · License: MIT

Tags: Inference Serving, TensorRT-LLM, NVIDIA, Inference Optimization, High Throughput, Low Latency, Production, FP8, INT4, In-Flight Batching, Multi-GPU

该 Skill 暂无文档文件。

安装指令

npx skills add davila7/tensorrt-llm

下载解压包

下载 skill.zip

下载完整 Skill 目录，包含 SKILL.md 及所有相关文件

信息

Authordavila7

Categorydeveloper

Version@main

Last Updated95 days ago

LicenseMIT

在 GitHub 中查看

About tensorrt-llm

tensorrt-llm 是由 davila7 开发的 AI Agent 技能，属于「developer」分类。 Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling. 该技能支持 Inference Serving、TensorRT-LLM、NVIDIA、Inference Optimization、High Throughput、Low Latency、Production、FP8、INT4、In-Flight Batching、Multi-GPU 相关能力，可直接集成到兼容的 AI Agent 平台中使用。安装后，Agent 将获得该技能定义的工具、提示词或工作流，从而在对话中自动调用相应功能。

Usage Scenarios

将 tensorrt-llm 安装到你的 AI Agent 后，可以在日常对话中自动触发相关能力，无需手动切换工具。
作为 developer 类技能，它可以扩展 Agent 在该领域的能力边界，让 Agent 处理更复杂的任务。

Frequently Asked Questions

什么是 tensorrt-llm？

tensorrt-llm 是一个 AI Agent 技能，由 davila7 开发，归类于「developer」。安装后，它会为你的 Agent 增加新的能力，让 Agent 能够执行更丰富的任务。

如何安装这个技能？

点击页面右侧的安装命令复制到终端执行即可。大多数技能使用 npx skills add 命令安装，部分技能也支持手动下载 ZIP 文件。

这个技能是免费的吗？

该技能在 AgentCC 上免费提供。但部分技能可能依赖第三方 API 或服务，使用时请查看技能文档了解是否需要额外的 API Key 或付费服务。

安装后如何使用？

安装成功后，技能会自动注册到你的 Agent 平台。在与 Agent 对话时，当你的需求匹配该技能的能力范围，Agent 会自动调用该技能完成任务。

这个技能和其他同类技能有什么区别？

每个技能的实现方式、覆盖范围和作者不同。建议对比页面底部的「相关技能推荐」中的同类选项，选择最符合你需求的技能。

Related Skills

local-places

Search for places (restaurants, cafes, etc.) via Google Places API proxy on localhost.

246,840·openclaw

github

Interact with GitHub using the `gh` CLI. Use `gh issue`, `gh pr`, `gh run`, and `gh api` for issues, PRs, CI runs, and advanced queries.

246,840·openclaw

skill-creator

Create or update AgentSkills. Use when designing, structuring, or packaging skills with scripts, references, and assets.

246,840·openclaw

voice-call

Start voice calls via the OpenClaw voice-call plugin.

246,840·openclaw

notion

Notion API for creating and managing pages, databases, and blocks.

246,840·openclaw

gemini

Gemini CLI for one-shot Q&A, summaries, and generation.

246,840·openclaw

Category:developer

Tags:Inference Serving, TensorRT-LLM, NVIDIA, Inference Optimization, High Throughput, Low Latency, Production, FP8, INT4, In-Flight Batching, Multi-GPU

onlinev0.1.0

AgentCC / The Agent Context Center

主站 Developer Toolstensorrt-llm

tensorrt-llm

21.8k Stars0 Installs更新于 95 days agoMIT

Inference Serving TensorRT-LLM NVIDIA Inference Optimization High Throughput Low Latency Production FP8 INT4 In-Flight Batching Multi-GPU

About tensorrt-llm

Category: developer (开发工具) · Author: davila7 · Version: @main · License: MIT

Tags: Inference Serving, TensorRT-LLM, NVIDIA, Inference Optimization, High Throughput, Low Latency, Production, FP8, INT4, In-Flight Batching, Multi-GPU

该 Skill 暂无文档文件。

安装指令