#TensorRTLLM X Hashtag

Explore tweets tagged as #TensorRTLLM

ManuAGI 🤖 - ( ManuIn )

@ManuAGI01

19 days

🚀TensorRTLLM🔥. 'TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes

1

0

ManuAGI 🤖 - ( ManuIn )

@ManuAGI01

4 months

🚀Project Number 5 - TensorRT-LLM🔥. ' Turbocharging Your Large Language Model Inference!'. #opensource #github #trending #AI #coding #search #softwaredevelopment #technology #innovation #programming #AItools #TensorRTLLM

1

0

Mofi Rahman

@moficodes

1 year

3. TensorRTLLM with GPUs -

1

3

Kyle Mistele 🏴‍☠️

@0xblacklight

1 year

Time to rip out vllm for triton+tensorrtllm immediately after ripping out llama.cpp for vllm

0

1

govindhtech

@TechGovind70399

1 year

New NVIDIA L40S GPU-accelerated OCI Compute Instances.Read more on #NewNVIDIaL40SGPU #acceleratedOCIComputeInstances #DigitalTwins #largelanguagemodels #llm #NVIDIAHGXH100 #generativeAI #Llama3 #TensorRTLLM #NVIDIADLSS3 #NVIDIAOmniverse #gpu #GDDR6memory

0

Synapsi Community

@Synapsi_Off

1 year

E poiché tutto si svolge localmente sul tuo PC o workstation Windows RTX, avrai risultati veloci e sicuri. #ChatRTX #GPT #RAG #TensorRTLLM #RTX #Personalizzazione #Chatbot.

0

Aratako

@Aratako_LM

7 months

このvLLM vs TensorRT-LLMシリーズがかなり良かった.LLMの推論周りの諸々について分かりやすい説明から入ってその後パフォーマンス比較されるので、vLLMやTensorRT-LLM自体に興味ない人にも勉強になると思う. [vLLM vs TensorRT-LLM] #1. An Overall Evaluation - SqueezeBits

1

14

101

Akashians

@akashians_

1 year

@NVIDIAAIDev pushing @akashnet_ $akt #akt . Check it out ⬇️⬇️. Optimize your Meta Llama 3 #inference with TensorRT-LLM. …and how! ✨ in only 4 steps ✨. 📗➡️. @AIatMeta

0

6

13

Kain Jares - The GenAI Alien

@GenAiAlien

7 months

@Apple working with @nvidia to improve the speed of #TensorRTLLM by almost a favor of 3x was not on my bingo card for 2024.

0

NVIDIA AI Developer

@NVIDIAAIDev

1 year

🦙🦙🦙 Optimize your Meta Llama 3 #inference with TensorRT-LLM. …and how! ✨ in only 4 steps ✨. 📗➡️. @AIatMeta

4

23

102

David Pinto

@dpinto64

1 year

Optimize your Meta Llama 3 #inference with TensorRT-LLM in only 4 steps. @AIatMeta @nvidiaaidev

0

1

Andrey Cheptsov

@andrey_cheptsov

6 months

Benchmarking @vllm_project vs @nvidia TensorRT LLM (VLMs) by SqueezeBits team

0

1

7

Mati

@KMatiDev1

2 years

I ran a benchmark with int4 and achieved an inference speed of about 100 tokens per second. 🚀. Here is the script : 👇👇👇 . #buildinpublic #indiehackers #AI #TensorRTLLM.

0

3

Jesse Ben Israel

@jessebenisrael

1 year

Exciting news: Gemma 7B has been deployed with TensorRT-LLM, achieving over 500 tokens per second, a significant advancement in language processing! #Gemma7B #TensorRTLLM.

0

1

GenAINews.co

@genainewstop

8 months

Accelerate time to first token with NVIDIA TensorRT-LLM KV cache early reuse techniques! Learn how to optimize KV cache for faster response times. #TensorRTLLM #KVCacheReuse #NVIDIA #AI #Efficiency".

0

Jigar Halani

@JigarHalani3

1 year

Optimize your Meta Llama 3 #inference with TensorRT-LLM in only 4 steps. @AIatMeta @nvidiaaidev

0

1

NVIDIA Asia Pacific

@NVIDIAAP

2 years

NVIDIA has just released TensorRT-LLM, a game-changer for optimizing large language models in AI inference. Don't worry, you can catch up by reading our technical blog. Dive into the details now! #NVIDIA #TensorRTLLM #AIinference

0

1

AI o AI

@aioaipl

2 years

Prędkość spotyka wydajność! 🚀 NVIDIA TensorRT-LLM, potężne narzędzie optymalizacji, teraz dostępne za darmo! Nie przegap szansy na przyspieszenie swoich projektów #AI. Sprawdź darmową bibliotekę #NVIDIA #TensorRTLLM #OptymalizacjaAI #Technologia.

0

1

nauzetjesus.com

@nauzetjesus_

1 year

Chat con RTX: tu chatbot personal de IA en tu PC .#ChatConRTX #NVIDIA #IAGenerativa #Chatbot #LLM #GPT #RAG #TensorRTLLM #AceleraciónRTX #GPU.

0

1

0

sagar desai

@sagar_desai_x

1 year

Optimize your Meta Llama 3 #inference with TensorRT-LLM in only 4 steps. @AIatMeta @nvidiaaidev

0