dida_ML Profile Banner
dida Profile
dida

@dida_ML

Followers
82
Following
34
Media
46
Statuses
122

dida is pushing to bring AI-powered software solutions to the broad industry. On twitter, we'll share learnings from our involvement with leading edge research.

Berlin - Germany
Joined April 2021
Don't wanna be here? Send us removal request.
@dida_ML
dida
5 months
In our last reading group, dida’s Machine Learning Scientists discussed the paper ( from deepseek about the DeepSeek-R1 model and their approach of using reinforcement learning directly without prior fine-tuning.
0
0
1
@dida_ML
dida
8 months
OpenAI’s new o1 model boosts accuracy with chain-of-thought reasoning, excelling in complex tasks like IMO challenges (83% vs. 13% for GPT-4o) and competitive programming. Here you can read our summary for the o1 model: #AI #OpenAI #ML #innovation
Tweet media one
0
0
0
@dida_ML
dida
10 months
NVIDIA Shrinks LLama3.1 8B to 4B with Pruning and Distillation. NVIDIA's latest research reduces LLama3.1 8B to 4B parameters by pruning 50% of its layers. Retrained with 40x fewer tokens, the model sees a 16% MMLU score boost. Read more: #AI #research.
0
0
0
@dida_ML
dida
11 months
We would like to present a paper on Physics-Informed Neural Networks (PINNs) that use physical laws to improve neural network training. It features a recent PyTorch implementation demonstrating PINNs by calculating gravity from thrown ball data. #ML.
0
0
1
@dida_ML
dida
11 months
We recently examined MambaVision, a new backbone combining Vision Transformers (ViT) with Mamba. This integration boosts scores and performance for tasks like object detection and instance segmentation. More details here: #machinelearning #tech #ML #AI.
0
0
2
@dida_ML
dida
1 year
Choosing the Right GPT Model for Your Business. For processing 4,500 documents per month:. ◾ It could either spend 126 EUR for the overall task or up to 7.150 EUR .◾ GPT-3.5 is 50 times cheaper than GPT-4 .◾ GPT-4o is still 5 times cheaper than GPT-4. #AI #LLM #GPT #TECH4ALL
Tweet media one
Tweet media two
0
0
0
@dida_ML
dida
1 year
Visual Instruction Tuning. Learn about "Visual Instruction Tuning": GPT-4 generates multimodal instruction tracking data. ViT-L/14 integration with Vicuna enables image comprehension and conversational applications. Read more:
0
0
0
@dida_ML
dida
1 year
dida will be at #ICLR2024 with two papers in Vienna this year. Check them out in case you are there.
@lorenz_richter
Lorenz Richter
1 year
On my way to #ICLR2024. I'm looking forward to interesting discussions. Let me know if you're up for a coffee!. Wed: Fast and unified path gradient estimators for normalizing flows (.Thu: Improved sampling via learned diffusions (.
0
0
4
@dida_ML
dida
2 years
We are thrilled to announce the launch of our brand-new website design!. We aimed to create a user-friendly, informative, and visually appealing platform for our blog readers, customers, and partners. Take a look at our new website design at
Tweet media one
2
0
0
@dida_ML
dida
2 years
follow us on X for more interesting reading group posts: @dida_ML.
0
0
0
@dida_ML
dida
2 years
paper:
1
0
0
@dida_ML
dida
2 years
It seems that just by comparing predictions of a network to those from a weighted average of its past parameters you can produce information from almost nothing.
1
0
0
@dida_ML
dida
2 years
This week in our reading group we covered #BYOL. A very unusual form of self-supervised learning that seems to learn reasonable representations almost from nothing!
Tweet media one
1
0
1
@dida_ML
dida
2 years
𝐆𝐢𝐭𝐇𝐮𝐛 𝐒𝐨𝐮𝐫𝐜𝐞 𝐂𝐨𝐝𝐞: 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡 𝐏𝐚𝐩𝐞𝐫: Follow us on X for more interesting #ML papers that you can practically apply to your projects.
0
0
0
@dida_ML
dida
2 years
The network's code and trained weights are publicly accessible, signaling its readiness for integration and further exploration in various real-world applications. #AI.
1
0
0
@dida_ML
dida
2 years
This approach allows it to manage occlusion and non-repeatable key points adeptly. It is trained end-to-end, enabling it to learn significant priors for pose estimation from a comprehensive annotated dataset, furthering its ability to reason about the 3D scene and assignments.
Tweet media one
1
0
0
@dida_ML
dida
2 years
The paper presents SuperGlue, a neural network designed for efficient feature matching. It determines correspondences and filters out non-matchable points, utilizing a graph neural network and attention mechanisms to solve an optimal transport problem.
1
0
1
@dida_ML
dida
2 years
𝐒𝐮𝐩𝐞𝐫𝐆𝐥𝐮𝐞: In our last reading group we discussed a paper called "#SuperGlue: Learning Feature Matching with Graph Neural Networks"
Tweet media one
1
0
1
@dida_ML
dida
2 years
Celebrating 5 years with the dida-conference!🎂 . We had an amazing day filled with machine learning (#ML) talks, panels, and workshops featuring top organizations. We captured some moments of it and turned them into a short video. 𝐯𝐢𝐝𝐞𝐨:
0
1
3
@dida_ML
dida
2 years
Link to the paper: For more summaries of interesting research papers, make sure to follow us here on 𝕏 so you don't miss out on any updates. #AI #ML.
0
0
0