
Michel
@mike_pavlukhin
Followers
647
Following
3K
Media
246
Statuses
827
ai, experiments, games and e.t.c Check my new game https://t.co/XOOJs2fq0C
Joined March 2015
I launched Timeboat Adventures, an experimental narrative game where you rescue historical figures and merge them into a single meta-entity that rewrites the 20th century. Powered by @DSPyOSS and Gemini-2.5-Flash from @googleaidevs. Try it here https://t.co/4OeKHeThph
#AI #LLM
6
8
56
Useful DSPy tip: You can return feedback in the reward function for dspy.Refine. This gives you more control over your generations. I wish this information were included in the DSPy documentation. It’s one of my most-used modules when I don’t have time to optimize programs.
1
1
5
I’d love to hear your thoughts! I’m planning to write more tutorials and share write-ups about my experiments. What topics would you be most interested in reading about?
0
0
3
My first tutorial on creative task optimization with DSPy and GEPA is online! Not everything went as expected, but it’s a solid start toward better results. https://t.co/3UaaUCCcCB
Finished my first tutorial on improving the quality of smaller LLMs for creative tasks using a teacher + GEPA. I'll publish it on Monday. What style of signature do you prefer? I usually go with classes, but for this tutorial I decided to use inline
5
38
231
and it sucks that substack doesn't have good syntax highlighting for code snippets
0
0
4
Caught a fever and sore throat earlier this week, so the tutorial got delayed. Feeling much better now and back to work - wrapping it up soon!
4
1
61
It feels like a heuristic, but if you tell an LLM in the system prompt that the text was generated by another LLM, it tends to evaluate more strictly and accurately. no more “you did a good job, for a human being”.
1
2
14
It’s kinda sad that there’s no easy way to access hidden states in DSPy. The dependency on LiteLLM is quite limiting when working with local LLMs. I might implement a custom handler for vLLM to test CLUE. it seems like an awesome alternative to the LLM-as-a-judge approach.
Verifier with NO trained parameters?🤯 Even outperform GPT-4o in terms of verification accuracy. We at Tencent AI Lab introduce CLUE - 🕵️ a verifier based on clustering where successful vs. failed reasoning creates separate hidden states. Paper link: https://t.co/ufELmcDgZc
1
2
62
Finished my first tutorial on improving the quality of smaller LLMs for creative tasks using a teacher + GEPA. I'll publish it on Monday. What style of signature do you prefer? I usually go with classes, but for this tutorial I decided to use inline
9
10
204
Personally, I'd still prefer a plain UML diagram without any artifacts and piss filter. But anyway What's your approach to creating diagrams?
1
0
2
Here is the final result. I spent more time writing this thread than creating the diagram
1
0
1
Pro tip: Paste your MermaidJS code along with the image to Nano Banana This helps it understand the structure better and slightly improves output quality
1
0
0
3. Go to Google's AI Studio and give your diagram to Nano Banana Use a prompt like: "Generate a diagram based on the schema's content and structure, but with a cool sci-fi aesthetic"
1
0
0
So here is the better way 1. Ask ChatGPT to generate a MermaidJS diagram based on your tutorial/concept
1
0
0
Here is the problem: Current image generation models simply can't create coherent technical diagrams. They're just disfigured abominations with no logic and sense.
1
0
0
I hate AI-generated diagrams in blogs They're confusing, ugly, unreadable When I see one, I want to close tutorial immediately I get it, sometimes we don't have enough time So, here's a mini-tutorial on how to generate much better diagrams with Nano Banana in 5 minutes🧵
1
0
4
I GAVE MY AGENTS TOOLS! in case of ASCII art generation, agents now can render text to png. It feels like that helps them make better analyses and improves quality. it's interesting that gemini-2.5-flash-lite uses tools more often than 2.5-pro. Pro prefer just thinking.
Anyone interested in implementation of a multi-agent debate system with DSPy? rn I'm updating my optimization scripts for GEPA. What I love about DSPy is how it easy to optimize the whole multi-agent system at once. no need to tune each agent separately. on Github soon
0
2
20
I want to build a cool visualization project: a TV-debate show with different agent/persona/LLMs streaming 24/7. Short on time/resources - would love help from anyone who would like to support the project. Please DM me!
1
0
10
Anyone interested in implementation of a multi-agent debate system with DSPy? rn I'm updating my optimization scripts for GEPA. What I love about DSPy is how it easy to optimize the whole multi-agent system at once. no need to tune each agent separately. on Github soon
8
22
174