Aslan @avatsaev X Profile

Aslan

@avatsaev

Followers

609

Following

2K

Media

606

Statuses

5K

Software engineer working with web technologies, data, and AI.

France

Joined February 2011

Don't wanna be here? Send us removal request.

Aslan

@avatsaev

3 months

Engineer's Guide to running Local LLMs with #llamacpp on Ubuntu, @Alibaba_Qwen Coder 30B running locally along with QwenCode in your terminal https://t.co/DZC9C25YUO

dev.to

Introduction In this write up I will share my local AI setup on Ubuntu that I use for my...

0

Ahmad

@TheAhmadOsman

3 months

the Qwen bros COOKED with this one Qwen3 Next 80B-A3B > DeepSeek V3.1-level intelligence > at less than half the size extremely smart & efficient > 256k context > instruct & thinking variants > 3B active params per token (3.8%) run this at home on 8x RTX 3090s Buy a GPU ;)

16

12

166

Aslan

@avatsaev

5 months

Spolier alert the problem was --flash-attn

0

Aslan

@avatsaev

5 months

Qwen3-30B-A3B-Instruct repetitions problem running in llamacpp, my sampling params: `--ctx-size 4000 --temp 0.7 --top-p 0.8 --top-k 20 --min-p 0 --repeat-penalty 1.05 --presence-penalty 1.05 --flash-attn --mlock` did anyone manage to fix this? im running a quantized version in

1

0

Aslan

@avatsaev

5 months

Locally running new #qwen3 instruct model (via #llamacpp), in a locally running chat UI, with locally running #MCP servers and locally hosted search engine, the speeds here are insane 🚀 The new @Alibaba_Qwen A3B Instruct is a very impressive model, congrats to the team!

0

4

Aslan

@avatsaev

9 months

Data intelligence day in Paris @databricks

0

2

Aslan

@avatsaev

1 year

https://t.co/f6RLFv1ynU

anthropic.com

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language models.

0

1

Aslan

@avatsaev

1 year

Large Enough | Mistral AI | Frontier AI in your hands

mistral.ai

Today, we are announcing Mistral Large 2, the new generation of our flagship model. Compared to its predecessor, Mistral Large 2 is significantly more capable in code generation, mathematics, and...

0

Aslan

@avatsaev

2 years

https://t.co/4VaKR7hSQS

0

FFmpeg

@FFmpeg

2 years

The xz fiasco has shown how a dependence on unpaid volunteers can cause major problems. Trillion dollar corporations expect free and urgent support from volunteers. @Microsoft @MicrosoftTeams posted on a bug tracker full of volunteers that their issue is "high priority"

192

3K

15K

Aslan

@avatsaev

2 years

Yi model paper - A comprehensive guide on training and finetuning robust models. In an age where detailed methodologies are often withheld, this paper offers deep insights into data collection, filtration, mixing, and the processing of SFT data https://t.co/n4piyWXoIX #ai #ml

arxiv.org

We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language...

0

AI at Meta

@AIatMeta

2 years

Today we’re releasing Code Llama 70B: a new, more performant version of our LLM for code generation — available under the same license as previous Code Llama models. Download the models ➡️ https://t.co/fa7Su5XWDC • CodeLlama-70B • CodeLlama-70B-Python • CodeLlama-70B-Instruct

163

1K

5K

Aslan

@avatsaev

2 years

https://t.co/YSQrVxpIgG

0

2

Aslan

@avatsaev

2 years

https://t.co/KYhKIG3FXH

kaggle.com

Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources

0

Aslan

@avatsaev

2 years

https://t.co/VuFaCjjdfq

philschmid.de

Learn how to train LLaMa 2 using QLoRA Hugging Face Transformers on Amazon SageMaker

0

Aslan

@avatsaev

2 years

https://t.co/cGpGSzPpmA #llm

github.com

An awesome & curated list of best LLMOps tools for developers - tensorchord/Awesome-LLMOps

0

Aslan

@avatsaev

2 years

https://t.co/eb2noA4dc2 by ⁦@CircuitLab⁩

0

Deno

@deno_land

2 years

A Next.js app requires dozens of config files — next.config.js, eslintrc.json, tsconfig.json, package.json, postcss.config.js, tailwind.config.js, and more. How did we get here? How do we avoid it? https://t.co/3UogVY7OhC