Explore tweets tagged as #Dataset_JSON
@MatternJustus
Justus Mattern
2 months
another easy to generate verifiable task: ask an LLM to generate a JSON that adheres to a very complex (LLM-generated) pydantic model. Coming soon to the most diverse RL dataset out there, along with many other tasks that are not math and coding
Tweet media one
7
17
184
@Arya_150
Arya🍊,💊
2 months
Sahara AI - Public Testnet SIWA. Faucet Testnet Click Register AI Assets.Connect new Wallet.Create Profile.Input Your dataset ( Upload a .json, .csv, or .txt ).Register Dataset & Minting.Visit Vault to View Your Dataset
0
0
0
@codepo8
Chris Heilmann codepo8.bsky.social
5 months
Generating 19,542 static HTML pages from a JSON dataset and a template.html in PHP in <10 seconds. It is bonkers how fast hard-drive access has become.
1
1
10
@victormustar
Victor M
1 year
🎉 Just shipped! Big update of the Hugging Face Datasets page. 📊 With new powerful filtering options:. 1. By Modalities (🖼️ Image, 🔊 Audio, 📝 Text, . ).2. By Dataset Size (from 1k to ∞ samples).3. By Format (JSON, CSV, Parquet, . ). Should be easier to find the perfect
Tweet media one
2
15
95
@SaharaLabsAI
Sahara AI 🔆
2 months
6/. 🔐Getting Started:. Registering and tokenizing datasets on SIWA is easy and takes seconds!. 1️⃣ Upload a .json, .csv, or .txt dataset using our guided Developer Portal. 2️⃣ Tokenize the dataset by minting your ERC-721 Ownership NFT. This will automatically log the dataset into
Tweet media one
5
21
111
@peakaustria
Thomas Reis
1 year
Daily Sea Surface Temperature Sunday March 10, 2024 21,22 ° C . 21.21,21.21,21.21,21.22 is the JSON dataset. Look around is anyone building Carbon Storing Straw or Hemp houses? Like some late prominent climate scientist suppose? Or massive weathering? And a global Job and Debt
Tweet media one
36
76
284
@Teknium1
Teknium (e/λ)
2 months
Another RL environment added to Atropos!. @MatternJustus released a pydantic schemas dataset that can be used to ask the model to create valid structured outputs of those objects - so I made an environment that asks the model to create JSON, YAML, TOML, etc and validate against
Tweet media one
@MatternJustus
Justus Mattern
2 months
another easy to generate verifiable task: ask an LLM to generate a JSON that adheres to a very complex (LLM-generated) pydantic model. Coming soon to the most diverse RL dataset out there, along with many other tasks that are not math and coding
Tweet media one
5
8
94
@intrstllrninja
interstellarninja
2 months
just created atropos rl env for structured outputs. could you guys recommend me some high quality json mode dataset on huggingface to use?
Tweet media one
1
0
1
@bricks_global
Life Bricks Global
5 months
An example of our annotated conversational AI training dataset in JSON. This particular section indicates how a portion of the data is delivered to clients. Available for subscription to train your #chatbot via #snowflake #AWS #googleanalytics #alibabacloud.#annotateddata
1
0
3
@c_fuscovirens
collema fuscovirens
2 months
OKLIPS v0.5 shipped!. What we have: .• Unique NFT generation from batch of PNG files.• Optional rarity weighting.• Feedback on how many unique pieces can be generated from the dataset.• Download each piece as PNG, it's JSON metadata or everything ZIPped. It's quite OK
Tweet media one
Tweet media two
2
4
16
@ikeri0
Ikerio
7 months
Working on creating the new dataset for Egregore's 7B fine tuned model. Just specify the model and the script will create a JSON file that has example questions and answers for the fine tuning process.Local data set generation and fine tuning made easy. Thanks to @EgregoreA66341
Tweet media one
2
5
11
@steren
Steren
1 year
By the way, I wrote a script to transform the GCP Locations page to a JSON file:.- Webpage: - JSON: - script: This is not an official dataset, but feel free to use the script / dataset.
Tweet media one
0
1
24
@NousResearch
Nous Research
11 months
Introducing a new open dataset release, Hermes Function Calling V1, the datamix that gave Hermes 2 Pro its tool use and structured output capabilities. HuggingFace Repo: The dataset includes single and multiturn Function Calling and Structured JSON
Tweet media one
18
89
619
@nimbus_696
☁️nhimbus☁️
2 months
🚀 SIWA Testnet is LIVE! . Ready to snag $SHRED? Just follow these simple steps:. 1️⃣ Get test tokens: 2️⃣ Connect your wallet: 3️⃣ Set up your profile.4️⃣ Upload any dataset (JSON, CSV or TXT).5️⃣ Register & mint your dataset.6️⃣ all set
1
0
2
@GoodluckDike3
Goodluck Dike
10 months
Our goal is to annotate and label 5,000 selected images from our collection of over 40,000 at @UiLandDesign using the VGG Image Annotator. Once annotated, we will export the dataset as JSON files, which will then be used to train machine learning models for UI element recognition
Tweet media one
@GoodluckDike3
Goodluck Dike
10 months
This is the first of two Machine Learning features @codewarsfx and I are developing :. 1. Search by Text on Images. 2. UI Element Detection and Automated Image Tagging: Identification of UI elements within images, automatically tagging them based on the components found.
0
1
3
@Sahara_VietNam
SAHARA AI VIETNAM 🔆 🇻🇳
2 months
6 ⚙️ Cách bắt đầu cực kỳ đơn giản:. 1️⃣ Tải lên dataset (.json, .csv, .txt) trên Developer Portal. 2️⃣ Tokenize dataset bằng cách mint NFT (ERC-721) đại diện quyền sở hữu. 3️⃣ Dataset sẽ được ghi nhận một cách tự động trong Sahara Global Registry. ✅ Nào! bây
Tweet media one
1
0
0
@Cybersoulja
Kevin
1 year
Screwhead B has officially finished training on my custom #DJScrew JSON dataset and is now registered with OpenAI. They grow up so fast! sniff
Tweet media one
0
0
0
@omarsar0
elvis
1 year
An LLM for Structured Extraction. Looks like a good set of LLMs for structured extraction. It fine-tunes a phi-3-mini on a private high-quality synthetic dataset for information extraction. Very straightforward to use: provide input text and JSON template describing the
Tweet media one
6
62
310
@JagersbergKnut
Knut Jägersberg
2 years
The GUI is quite nice. Major inconvenience with the tool is you have to prepare the data in a json format it expects, which is not just hf dataset format. Wrote the json with a for loop. First time DPOing!
Tweet media one
0
0
1
@nurijanian
George from 🕹prodmgmt.world
9 months
Pretty sure you can analyse clickstream data with Claude, need to test this more, I used a synthetic dataset from ChatGPT. PROMPT: . You will be analyzing JSON data of a clickstream from several users. Your task is to examine this data for user journeys, similarities between
Tweet media one
1
0
9