TLUG Mailing List

Mailing List Archive

tlug.jp Mailing List tlug archive tlug Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Anyone alive out here ?

Date: Thu, 5 Sep 2024 12:58:01 +0100

From: Darren Cook <darren@example.com>

Subject: Re: [tlug] Anyone alive out here ?

References: <f03af814-4ec7-6b38-59fb-3c1851faf7a8@gmail.com> <2d9532be-b1af-42d1-a8cd-8eae13f9f9d5@codewiz.org>
Later, I installed ollama and blogged about my experience with a fewfreely available LLMs:
   https://mstdn.io/@codewiz/112527717194517544
That was interesting. Which AMD GPU are you using?
Do you have a frontend speech-to-text model that spits out text as inputto a regular LLM, then feed the output to a TTS model? ...
...
What's the current state of the art? Interfacing models with text hasbig limits...
Quite closely related, I've been wondering what the state of the art foropen-source OCR is, particularly of Japanese text.
All the links go to Tesseract (or wrappers around it), which is simplynot good enough, even for English. Or to online tools where you uploadyour private data, and pay for the privilege.
This could then lead on to the greatest unsolved computing challenge ofthe 21st century, which is a PDF to text converter. Yes, yes, I know toyexamples work. I mean PDFs that were made in Microsoft Word, containmultiple columns and figures, and inset boxes, like a magazine, or acompany's annual report. And extracting all the sections with headings,in a reasonable reading order (suitable for, say, a screen reader).
I thought Google, OpenAI, etc. had solved it, as they train the LLMs onPDFs, and ChatGPT can be given a PDF and answer questions. But, as faras I've been able to find out, they either feed raw PDF bytes in, andhope, or they use pdf2text, and hope.
Darren
Follow-Ups:

Re: [tlug] PDF to text converter (was: Anyone alive out here ?)
From: Jim Blackson

Re: [tlug] Anyone alive out here ?
From: Bernie Innocenti

References:

Re: [tlug] Anyone alive out here ?
From: J. Hart

Re: [tlug] Anyone alive out here ?
From: Bernie Innocenti

Prev by Date: Re: [tlug] Anyone alive out here ?

Next by Date: Re: [tlug] PDF to text converter (was: Anyone alive out here ?)

Previous by thread: Re: [tlug] Anyone alive out here ?

Next by thread: Re: [tlug] PDF to text converter (was: Anyone alive out here ?)

Index(es):

Date

Thread

Home | Main Index | Thread Index