Explore tweets tagged as #Deduplicate
Webhooks that just work with @restatedev ✅ Event persistance - let Restate acknowledge and persist the event ✅ Reliable processing - leverage Durable Execution to run until completion ✅ Idempotency - use Idempotency-Key to deduplicate Check it out! https://t.co/uhr0zU4ojV
0
2
2
Today, we are open-sourcing our pipeline to deduplicate large-scale image datasets. On one GPU, we can deduplicate 10k images against 1M indexed test images in ~60 seconds. But how?
17
103
813
Learning about gossip protocol for my multi node communication blog. So it’s just broadcast and deduplicate
0
0
2
Drowning in POA&M chaos? 🌀 AI can turn that mess into order - deduplicate findings, rank risks, and wow auditors with clean reports. Streamline compliance with automation that actually works. Read more 👉 https://t.co/tIccTdaU8n
#DIB #Cybersecurity #FedRAMP #CMMC #AI #Quzara
0
0
0
Laravel Tip💡: Deduplicate Characters Laravel v11.20 introduces a new "deduplicate" method which allows you to remove duplicates from spaces or any character you choose 🚀 #laravel
2
8
67
how I monitor twitter for ai-related posts now. collect tweets every 90 minutes. deduplicate analyze based on prompt 1 (batches of 200 tweets) repeat x3 read the 10% valuable tweets and ignore 90% garbage.
1
0
2
Eagle 2: new Nvidia VLM with focus on post-training. They prioritize data strategy: "diversity first, then quality." They collect data from Arxiv, HuggingFace, deduplicate, filter low quality, augment by VLM rephrasing, etc. Final model is competitive with Qwen2-VL.
2
1
5
As of Payload 3.24.0, you can now use block references to deduplicate blocks, resulting in up to 3× faster page loads! Tested on a project with 600 blocks across 40 block fields (that’s 24,000 referenced blocks), it’s a massive performance boost for large-scale setups!
2
2
57
Wrote part two of the multi node broadcasting challenge. Wrote a blog about the brute force and how to deduplicate and send messages in a cluster of nodes
1
0
0
Use this prompt to efficiently organize, clean up, and prepare your GitHub issues: ------------------------------------- "We currently have open issues. Your job is to: 1. Deduplicate: Identify and merge any duplicate issues. 2. Titles: Make each issue title concise, clear, and
2
0
6
Data Leakage and deduplication are critical when training LLMs! SemHash is a new, blazingly fast semantic text deduplication library that combines Model2Vec embeddings with ANN-based similarity search through Vicinity, making it possible to deduplicate millions of records in
6
53
275
This is how you deduplicate a list in Python. You use the built-in `set` if you only care about the unique values. If the order is important, you use `dict.fromkeys`. These 2 tips only work with hashable values... But the values can be in a list or any other iterable.
2
4
47
using vector search to deduplicate knowledge graph vertex creation
0
0
1
process, filter, and deduplicate large-scale text data with customizable pipelines
1
0
4
Laravel Tip 🚀 Today's tip is perhaps a hidden gem that not many people know about. Have you ever wanted to remove duplicated characters from a string and had to create some complex logic or regex for doing that? Laravel has a string helper called deduplicate that helps you
4
11
42
always, and i mean ALWAYS, even if you have reasonable reason to believe that all samples are unique, deduplicate your datasets
14
5
181
0
0
0