dida @dida_ML X Profile

dida

@dida_ML

Followers

82

Following

34

Media

46

Statuses

122

dida is pushing to bring AI-powered software solutions to the broad industry. On twitter, we'll share learnings from our involvement with leading edge research.

Berlin - Germany

Joined April 2021

Don't wanna be here? Send us removal request.

dida

@dida_ML

5 months

In our last reading group, dida’s Machine Learning Scientists discussed the paper ( from deepseek about the DeepSeek-R1 model and their approach of using reinforcement learning directly without prior fine-tuning.

0

1

dida

@dida_ML

8 months

OpenAI’s new o1 model boosts accuracy with chain-of-thought reasoning, excelling in complex tasks like IMO challenges (83% vs. 13% for GPT-4o) and competitive programming. Here you can read our summary for the o1 model: #AI #OpenAI #ML #innovation

0

dida

@dida_ML

10 months

NVIDIA Shrinks LLama3.1 8B to 4B with Pruning and Distillation. NVIDIA's latest research reduces LLama3.1 8B to 4B parameters by pruning 50% of its layers. Retrained with 40x fewer tokens, the model sees a 16% MMLU score boost. Read more: #AI #research.

0

dida

@dida_ML

11 months

We would like to present a paper on Physics-Informed Neural Networks (PINNs) that use physical laws to improve neural network training. It features a recent PyTorch implementation demonstrating PINNs by calculating gravity from thrown ball data. #ML.

0

1

dida

@dida_ML

11 months

We recently examined MambaVision, a new backbone combining Vision Transformers (ViT) with Mamba. This integration boosts scores and performance for tasks like object detection and instance segmentation. More details here: #machinelearning #tech #ML #AI.

0

2

dida

@dida_ML

1 year

Choosing the Right GPT Model for Your Business. For processing 4,500 documents per month:. ◾ It could either spend 126 EUR for the overall task or up to 7.150 EUR .◾ GPT-3.5 is 50 times cheaper than GPT-4 .◾ GPT-4o is still 5 times cheaper than GPT-4. #AI #LLM #GPT #TECH4ALL

0

dida

@dida_ML

1 year

Visual Instruction Tuning. Learn about "Visual Instruction Tuning": GPT-4 generates multimodal instruction tracking data. ViT-L/14 integration with Vicuna enables image comprehension and conversational applications. Read more:

0

dida

@dida_ML

1 year

dida will be at #ICLR2024 with two papers in Vienna this year. Check them out in case you are there.

Lorenz Richter

@lorenz_richter

1 year

On my way to #ICLR2024. I'm looking forward to interesting discussions. Let me know if you're up for a coffee!. Wed: Fast and unified path gradient estimators for normalizing flows (.Thu: Improved sampling via learned diffusions (.

0

4

dida

@dida_ML

2 years

We are thrilled to announce the launch of our brand-new website design!. We aimed to create a user-friendly, informative, and visually appealing platform for our blog readers, customers, and partners. Take a look at our new website design at

2

0

dida

@dida_ML

2 years

follow us on X for more interesting reading group posts: @dida_ML.

0

dida

@dida_ML

2 years

paper:

1

0

dida

@dida_ML

2 years

It seems that just by comparing predictions of a network to those from a weighted average of its past parameters you can produce information from almost nothing.

1

0

dida

@dida_ML

2 years

This week in our reading group we covered #BYOL. A very unusual form of self-supervised learning that seems to learn reasonable representations almost from nothing!

1

0

1

dida

@dida_ML

2 years

𝐆𝐢𝐭𝐇𝐮𝐛 𝐒𝐨𝐮𝐫𝐜𝐞 𝐂𝐨𝐝𝐞: 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡 𝐏𝐚𝐩𝐞𝐫: Follow us on X for more interesting #ML papers that you can practically apply to your projects.

0

dida

@dida_ML

2 years

The network's code and trained weights are publicly accessible, signaling its readiness for integration and further exploration in various real-world applications. #AI.

1

0

dida

@dida_ML

2 years

This approach allows it to manage occlusion and non-repeatable key points adeptly. It is trained end-to-end, enabling it to learn significant priors for pose estimation from a comprehensive annotated dataset, furthering its ability to reason about the 3D scene and assignments.

1

0

dida

@dida_ML

2 years

The paper presents SuperGlue, a neural network designed for efficient feature matching. It determines correspondences and filters out non-matchable points, utilizing a graph neural network and attention mechanisms to solve an optimal transport problem.

1

0

1

dida

@dida_ML

2 years

𝐒𝐮𝐩𝐞𝐫𝐆𝐥𝐮𝐞: In our last reading group we discussed a paper called "#SuperGlue: Learning Feature Matching with Graph Neural Networks"

1

0

1

dida

@dida_ML

2 years

Celebrating 5 years with the dida-conference!🎂 . We had an amazing day filled with machine learning (#ML) talks, panels, and workshops featuring top organizations. We captured some moments of it and turned them into a short video. 𝐯𝐢𝐝𝐞𝐨:

0

1

3

dida

@dida_ML

2 years

Link to the paper: For more summaries of interesting research papers, make sure to follow us here on 𝕏 so you don't miss out on any updates. #AI #ML.

0