TLUG Mailing List

Mailing List Archive

tlug.jp Mailing List tlug archive tlug Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Are ordered hashes useful?

Date: Wed, 16 Feb 2011 11:59:05 +0900

From: Raymond Wan <rwan.kyoto@example.com>

Subject: Re: [tlug] Are ordered hashes useful?

References: <4D4E75AB.5060703@example.com> <4D4FB368.4010403@example.com> <AANLkTimk8AUJMRiwbPjGfDfciCwfaQr9_o3vBXtW2+rW@example.com> <87lj1rza76.fsf@example.com> <AANLkTikqy8G6G1Pt3ramSqxKHKJHSpo=zjHYLVENW0Gb@example.com> <878vxqyfok.fsf@example.com> <AANLkTiknPv3VsKzKQVxMacFE9i9aSSmwXDU4Fz+rZU3g@example.com> <AANLkTi=-0DXwOzc4yOriXptsvXkNZGh71Jgm=Rzu_2tW@example.com> <87y65iqchr.fsf@example.com> <4D59EC1B.9040403@example.com> <AANLkTinGHv=QWz++VYMLAWHZsLt1vUm0Nd5u_n2Jsaj_@example.com>

User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.16) Gecko/20101226 Icedove/3.0.11
Hi,


On 16/02/11 01:47, Marty Pauley wrote:
On Tue, Feb 15, 2011 at 11:59 AM, Raymond Wan<rwan.kyoto@example.com>  wrote:
Yes, I don't agree that dictionary order and ordered list
are the same.  Ordered list access could be provided by a
tree, but to enable dictionary access, you either need to
augment the tree with a hash table or build two trees in
parallel.
Don't say that too loudly.  All the databases in the world may stop working!!!

Maybe by "ordered list" you mean the insertion order, which is where
this thread started.  I'm talking about sorted key order, or
"dictionary order" as you mentioned.
Yes, I guess this discussion has branched off into manythings -- which is good to see. I've been talking about theorder in which data is inserted and I presume that is alsowhat you meant by "ordered list".
Quite frankly, I don't know the term "dictionary order".I've heard of it in the context of people talking aboutPerl, etc. but when I learned about algorithms (way way back:-) ), I don't recall anyone using the phrase "dictionary"[unless in some applications of algorithms...]. As far as Ican remember :-), not in general algorithms.
But when I studied CS, the word "Web" was only in researchpapers and not in newspapers. Don't get me started onFacebook. :-D
But my main point is that trees do provide dictionary access.  That's
what they're for.

In pseudocode:

   t = new Tree
   t.insert("banana" =>  "yellow")
   t.insert("carrot" =>  "orange")
   t.insert("apple" =>  "red")
   t.insert("durian" =>  "smelly")
   t.keys == {"apple","banana","carrot","durian"} # ordered access
   t.get("carrot") == "orange" # fast lookup

Most large dictionaries use trees, not hashes.  The whole point of a
tree is that it provides fast lookup O(log n) and ordered access.
Hashes don't cope well with a large amount of data (unless it's known
in advance).
Yes, you are correct but let's clarify things a bit further.
Trees give you O(log n) access but to the keys (and thustheir records) only. This is true for databases as well.Most people use some unique ID like a "social securitynumber" or passport number as the unique ID for records. Ifyou wanted to access anything else in the record (i.e., thatis not this unique ID), then you will need to build asecondary index on top of the data or do a linear searchthrough the data. I guess in SQL, it would be the "createindex" command -- but it's been a while since I used SQL.
Even in databases, I guess this secondary index would be atree; but if you knew of the properties of the data, Isuppose a hash could work...
As for hashes, they "can" work for large amount of dataprovided you have a way to deal with collisions (i.e., twodifferent things that hash to the same location). Whatmatters, I guess, is that hashing your inputs will give aroughly uniform distribution of hash locations. Sometimes,we don't know this beforehand and yes, trees are better then.
Of course, you can combine the two and have a "hash table oftrees". i.e., when two different things collide, you putthem in the tree at that location in the hash table...Perhaps you could also have a tree of hash tables...I'm notso sure that would be useful, though...
Ray
References:

[tlug] [Javascript] Shouldn't there be a sort option on objects
From: Dave M G

Re: [tlug] [Javascript] Shouldn't there be a sort option on objects
From: Raymond Wan

Re: [tlug] [Javascript] Shouldn't there be a sort option on objects
From: Josh Glover

Re: [tlug] [Javascript] Shouldn't there be a sort option on objects
From: Stephen J. Turnbull

Re: [tlug] [Javascript] Shouldn't there be a sort option on objects
From: Josh Glover

[tlug] Are ordered hashes useful?
From: Stephen J. Turnbull

Re: [tlug] Are ordered hashes useful?
From: Jun-Dai Bates-Kobashigawa

Re: [tlug] Are ordered hashes useful?
From: Marty Pauley

Re: [tlug] Are ordered hashes useful?
From: Stephen J. Turnbull

Re: [tlug] Are ordered hashes useful?
From: Raymond Wan

Re: [tlug] Are ordered hashes useful?
From: Marty Pauley

Prev by Date: Re: [tlug] [Lingo] Two direction full text search

Next by Date: [tlug] Docomo N2502 card on Linux

Previous by thread: Re: [tlug] Are ordered hashes useful?

Next by thread: Re: [tlug] Are ordered hashes useful?

Index(es):

Date

Thread

Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links