Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] A tool for fidning the total size of a web page?



>>>>> "A" == A Sajjad Zaidi <A.Sajjad> writes:

    A> How about this:

    A> wget -O - -r -l1 www.google.com 2>&1 | tail -1 | awk '{ print $2 }'

This is broken in several ways, especially -r -l1 which will get any
linked pages as well.  This will give a very wrong answer if you have
a short index of rfc*.txt, for example.  ;-)

wget -O - -p $url | wc

looks like a winner to me.

Hrvoje sez:

           Actually, to download
           a single page and all its requisites (even if they
           exist on separate websites), and make sure the lot
           displays properly locally, this author likes to use a
           few options in addition to -p:

                   wget -E -H -k -K -p http://I<site>/I<document>



-- 
Institute of Policy and Planning Sciences     http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
              Don't ask how you can "do" free software business;
              ask what your business can "do for" free software.


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links