TLUG Mailing List

Mailing List Archive

tlug.jp Mailing List tlug archive tlug Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Anyone alive out here ?

Date: Sun, 8 Sep 2024 12:12:29 +0900

From: Chris Salisbury <chris.salisbury@example.com>

Subject: Re: [tlug] Anyone alive out here ?

References: <f03af814-4ec7-6b38-59fb-3c1851faf7a8@gmail.com> <2d9532be-b1af-42d1-a8cd-8eae13f9f9d5@codewiz.org> <f48593f1-8a3a-440d-9b29-33199b6dcef2@dcook.org> <f6e0bde9-1926-495e-a8c3-14e80d7a0203@codewiz.org>

>>> What's the current state of the art? Interfacing models with text has big limits...
>> Quite closely related, I've been wondering what the state of the art for open-source OCR is, particularly of Japanese text.
> I'm waiting for the first Llama-like LLM with image recognition similar to ChatGPT.

Not sure if "similar to ChatGPT" precludes the much worse performance of the "Llama-like" models at https://ollama.com/search?c=vision, but I've been impressed with llava-phi3 for a model small enough to run on a phone that I got for free with a 2-month contract 3 years ago. I do this on (unrooted) android 11 with termux->proot->ollama. But I've never tried OCR with it, Japanese or otherwise.

References:

Re: [tlug] Anyone alive out here ?
From: J. Hart

Re: [tlug] Anyone alive out here ?
From: Bernie Innocenti

Re: [tlug] Anyone alive out here ?
From: Darren Cook

Re: [tlug] Anyone alive out here ?
From: Bernie Innocenti

Prev by Date: Re: [tlug] Anyone alive out here ?

Next by Date: Re: [tlug] [announcement] September 14th Technical meeting

Previous by thread: Re: [tlug] Anyone alive out here ?

Next by thread: Re: [tlug] Call for presenters Sept 14th

Index(es):

Date

Thread

Home | Main Index | Thread Index