Vilson RodriguesA Friendly Introduction to TensorRT: Building EnginesLearn to export models to an efficient model format5 min read·5 days ago----
Vilson RodriguesSparse, Quantize and Serving LLMs with NeuralMagic, AutoGPTQ and vLLMA guide to explore Sparse techniques to compress LLMs5 min read·Apr 9, 2024----
Vilson RodriguesAdd Non Maximum Suppression (NMS) to object detection model using ONNXIntegrate NMS node to your ONNX model6 min read·Sep 12, 2023----
Vilson RodriguesBuild a image preprocessing model using Pytorch and integrate into your model using ONNXReduce your project’s dependencies with ONNX3 min read·Sep 6, 2023----
Vilson RodriguesRun LLAMA-2 models in a Colab instance using GGML and CTransformersTry new META AI models in free enviroments5 min read·Jul 18, 2023--3--3
Vilson RodriguesServing Falcon models with 🤗 Text Generation Inference (TGI)Run your LLM eficiently with TGI and LangChain integration 5 min read·Jun 11, 2023--1--1
Vilson RodriguesRun your private LLM: Falcon-7B-Instruct with less than 6GB of GPU using 4-bit quantizationBuilding with BitsAndBytes, HuggingFace and LangChain5 min read·Jun 9, 2023--4--4
Vilson Rodrigues🤖ChatTube🎥: Chat with Youtube VideoBuilding a Retrieval Question Answering System to YouTube videos with LangChain, OpenAI and FAISS7 min read·Jun 6, 2023----
Vilson RodriguesUma análise de tweets com as #TheBatman e #BatmanTrabalho referente ao requisito da disciplina Network Analysis ministrada pelo professor Ivanovitch Silva. Feito em grupo com o Pedro…4 min read·Feb 18, 2022----
Vilson RodriguesNetwork Analysis em playlists do SpotifyEsse trabalho é referente a disciplina Network Analysis — UFRN.9 min read·Jan 19, 2022----