avatsaev Profile Banner
Aslan Profile
Aslan

@avatsaev

Followers
609
Following
2K
Media
606
Statuses
5K

Software engineer working with web technologies, data, and AI.

France
Joined February 2011
Don't wanna be here? Send us removal request.
@avatsaev
Aslan
3 months
Engineer's Guide to running Local LLMs with #llamacpp on Ubuntu, @Alibaba_Qwen Coder 30B running locally along with QwenCode in your terminal https://t.co/DZC9C25YUO
Tweet card summary image
dev.to
Introduction In this write up I will share my local AI setup on Ubuntu that I use for my...
0
0
0
@TheAhmadOsman
Ahmad
3 months
the Qwen bros COOKED with this one Qwen3 Next 80B-A3B > DeepSeek V3.1-level intelligence > at less than half the size extremely smart & efficient > 256k context > instruct & thinking variants > 3B active params per token (3.8%) run this at home on 8x RTX 3090s Buy a GPU ;)
16
12
166
@avatsaev
Aslan
5 months
Spolier alert the problem was --flash-attn
0
0
0
@avatsaev
Aslan
5 months
Qwen3-30B-A3B-Instruct repetitions problem running in llamacpp, my sampling params: `--ctx-size 4000 --temp 0.7 --top-p 0.8 --top-k 20 --min-p 0 --repeat-penalty 1.05 --presence-penalty 1.05 --flash-attn --mlock` did anyone manage to fix this? im running a quantized version in
1
0
0
@avatsaev
Aslan
5 months
Locally running new #qwen3 instruct model (via #llamacpp), in a locally running chat UI, with locally running #MCP servers and locally hosted search engine, the speeds here are insane 🚀 The new @Alibaba_Qwen A3B Instruct is a very impressive model, congrats to the team!
0
0
4
@avatsaev
Aslan
9 months
Data intelligence day in Paris @databricks
0
0
2
@FFmpeg
FFmpeg
2 years
The xz fiasco has shown how a dependence on unpaid volunteers can cause major problems. Trillion dollar corporations expect free and urgent support from volunteers. @Microsoft @MicrosoftTeams posted on a bug tracker full of volunteers that their issue is "high priority"
192
3K
15K
@avatsaev
Aslan
2 years
Yi model paper - A comprehensive guide on training and finetuning robust models. In an age where detailed methodologies are often withheld, this paper offers deep insights into data collection, filtration, mixing, and the processing of SFT data https://t.co/n4piyWXoIX #ai #ml
Tweet card summary image
arxiv.org
We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language...
0
0
0
@AIatMeta
AI at Meta
2 years
Today we’re releasing Code Llama 70B: a new, more performant version of our LLM for code generation — available under the same license as previous Code Llama models. Download the models ➡️ https://t.co/fa7Su5XWDC • CodeLlama-70B • CodeLlama-70B-Python • CodeLlama-70B-Instruct
163
1K
5K
@deno_land
Deno
2 years
A Next.js app requires dozens of config files — next.config.js, eslintrc.json, tsconfig.json, package.json, postcss.config.js, tailwind.config.js, and more. How did we get here? How do we avoid it? https://t.co/3UogVY7OhC
Tweet card summary image
deno.com
Why a Next.js project has over 30 configuration files and what we can do to avoid it.
17
58
347