Ines Montani
@_inesmontani
Followers
21K
Following
6K
Media
671
Statuses
4K
⚠️ Inactive. Bluesky: https://t.co/zwvba933lq LinkedIn: https://t.co/gXMmVHWM1m
Joined May 2008
My video on @spacy_io layout is now out! This is probably my favorite update from @explosion_ai (and that's saying something!) This package makes it simple to do region detection, table detection, and OCR with just 1 line of Python. #MachineLearning
5
12
46
Going live now! Join me for the first pilot stream on YouTube.
Trying something new this year: I'm going to stream spaCy and other open-source development and maintenance twice a week! I hope this will give the community a better inside look while things are in progress. I'll also answer your questions and chat about NLP, OSS and more.
0
4
17
This is such a cool idea! Get an inside look into the life and work of an open-source developer and chat about NLP and more. I'll probably hang out in the chat for a bit as well 💙
Trying something new this year: I'm going to stream spaCy and other open-source development and maintenance twice a week! I hope this will give the community a better inside look while things are in progress. I'll also answer your questions and chat about NLP, OSS and more.
0
1
23
Writing a new talk on "What the history of the web can teach us about the future of AI" 🔮 Wanted to do this for so long because I think there are some great lessons & analogies here. I'll be presenting it for my keynote at @PyConWeb in Berlin on Jan 25 – hope to see you there!
2
6
30
🚀 Excited to introduce our keynote speaker, @_inesmontani , who will open the conference day with her talk, "What can the history of the web teach us about the future of AI?" 💫 Delighted to welcome you to PyCon+Web 2025, Ines! Join us for a day featuring 15+ speakers across
1
4
11
Thank you @_inesmontani for building spaCy , it tells us that , NLP isn't only deep learning , transformers and LLMs. It's so useful for certain tasks all at one place , more imp - production friendly. spaCy have most things all at one place and reminds - not to use sword for a
4
3
44
If you're using Prodigy and want to beta test it, check out this post for more details:
support.prodi.gy
Hey everyone! 👋 We're happy to introduce a new Prodigy plugin we've been working on that's now available for beta testing. Prodigy Dashboard adds a new command dashboard that starts a web applicat...
0
0
2
New plugin coming to Prodigy soon: a visual data dashboard! 📊✨ Manage, view and filter annotations and access data analytics and progress all in a neat web app. (Link for beta testing in thread!)
2
3
21
👩🏫 12 conferences, 5 keynotes, 13 talks 🎙️ 4 podcast interviews, 3 panel discussions ✍️ 11 blog posts, 4 personal posts, 3 case studies 🔮 focus of my work and vision 🖤 personal, podcasts, TV, books, music and concerts https://t.co/LKJuQnONnY
2
0
7
Happy Birthday to me! 🥳 It feels like a good opportunity to look back at 2024, an eventful but also difficult year for me. So here's my personal review, including travel, talks, writing and various things I did and enjoyed.
12
3
54
Looking forward to joining the panel on PyLadies entrepreneurs and career development at @PyLadiesCon on Saturday 💖 It's a fully online conference with many cool people and talks, and you can still register!
1
1
5
Including: ▪️ why PDFs are a bad "source of truth" ▪️ implementation with @spacy_io and Docling ▪️ working with tabular data ▪️ end-to-end information extraction workflow ▪️ data collection and training Read more:
explosion.ai
This blog post presents a new modular workflow for converting PDFs and similar documents to structured data and shows you how to build end-to-end document understanding and information extraction...
0
5
32
New post: From PDFs to AI-ready structured data 📃✨ A deep dive into document processing, layout analysis and a modular workflow for building end-to-end document understanding and information extraction pipelines using PDFs, Word documents, scans and more.
4
44
210
The nice thing about it is that you can control how the table should be represented in the document text, e.g. for further information extraction and to make it easier for the model to handle. That's what I want to play with next, including rephrasing tables in natural language.
1
1
5
Many of you have been asking about PDF table extraction and I finally got around to experimenting with it 👀 Here's tabular data converted with Docling + TableFormer, anchored within the document text and accessible as a pandas.DataFrame:
7
20
124
$ pip install spacy-layout Code, examples and documentation: https://t.co/zXZWSTjgF1
github.com
📚 Process PDFs, Word documents and more with spaCy - explosion/spacy-layout
0
0
5
The first version of my spaCy + Docling integration is here: 📚 process PDFs, Word documents & more 📝 structured text-based output via @spacy_io's Doc 🏷 layout spans for sections, headings etc. 🔮 apply NLP pipelines to PDFs ✂️ chunk your data for RAG
3
12
67