Link grabber

  • Thread starter Thread starter Bert Hibberd
  • Start date Start date
B

Bert Hibberd

I'm looking for a freeware program that will extract all the
hyperlinks from any given page.

I've tried to search through Google, but can't locate anything. Yet
I'm sure that something does exist.

Can anyone help!

Thanks,

Bert Hibberd
Australia
 
Darrien said:
Try to find this download:
http://support.microsoft.com/default.aspx?scid=kb;en-us;198045

Search for "Links List" on that page.

I have had no problems getting it to work with IE 6.0 despite what it
says.

Really? From where did you download it? I checked the above site and
while it does list several add-ons
for IE and does mention Links.. the site at the bottom of the page takes
one to another page and the very
first sentence at the top of the page is....WE APOLOGIZE, THE SITE YOU
ARE LOOKING FOR NO LONGER EXISTS... so when you say "it works with IE
6.0 despite what it says" - what do you mean? What are you
talking about since this apparently no longer exists. I just tried it
about five minutes ago. It is a BIG NO-GO!
So what am I missing? Do you have another site? Thanks.

Helen
 
Try to find this download:
http://support.microsoft.com/default.aspx?scid=kb;en-us;198045

Search for "Links List" on that page.

I have had no problems getting it to work with IE 6.0 despite what it
says.

Here's where that takes you
to...http://www.microsoft.com/Windows/Ie/IE5/custom.asp
so just go here first and read..... Our Apologies...
The page you are looking for no longer exists..... then it lists
about 8 or 10 other areas
and LINKS isn't one of them.... then to this

http://www.microsoft.com/windows/ie/downloads/default.asp
then to
http://www.microsoft.com/windows/ie/downloads/addon/default.asp
then to
..... NOTHING! This no longer exists. So how long have you had it on
your
computer?

Helen
 
Really? From where did you download it?

http://www.microsoft.com/windows/ie/previous/webaccess/default.asp

I checked the above site and while it does list several add-ons
for IE and does mention Links.. the site at the bottom of the page takes
one to another page and the very
first sentence at the top of the page is....WE APOLOGIZE, THE SITE YOU
ARE LOOKING FOR NO LONGER EXISTS...

That's why I said "try to find this download"

I was too lazy to find it myself at the time of my original posting.
so when you say "it works with IE 6.0 despite what it says" - what do
you mean?

I never said that. Please don't attribute quotes to me that I have not
said.
What are you talking about since this apparently no longer
exists. I just tried it about five minutes ago. It is a BIG NO-GO!
So what am I missing? Do you have another site? Thanks.

Helen

Could you possibly change this to a proper .sig?

Change the "---" to "-- " (notice the space at the end)
 
Bert Hibberd said:
I'm looking for a freeware program that will extract all the
hyperlinks from any given page.

I've tried to search through Google, but can't locate anything. Yet
I'm sure that something does exist.

Can anyone help!

Thanks,

Bert Hibberd
Australia

Have you tried VisitURL? Check the options to customize.
http://www.tranglos.com/free/index.html

I haven't had need of this feature, but I have seen it mentioned when
looking at some other programs. At least one of the proxy servers
also offered this as part of its prefetching feature.

Please let us know what you found and which was most useful.

BillR
 

While at Son of Spy's site, I incidentally ran across LinkChaser.
While looking for URLs Out!, I also found URL Bandit (Xteq) and
WebRefs. I suspect that there are other programs that might meet your
needs listed as well: I only looked at part of two pages.

BillR

LinkChaser is a companion program of sorts to URLs Out! - or maybe the
antithesis... Basically you drop HTML files on it, it clips all the
links and makes a nice directory full of Internet Shortcuts for you.
Since MS already includes features to convert bookmark lists, the
intention of this program is really for a browser assistant - drop the
HTML source files on the program and all the shortcuts get churned out
without a lot of right-clicking. I've seen a few apps do this, but not
exactly in the same way. Needs VB5 runtimes AND MessageHook .dll
available HERE. (123Kb) Download 140Kb
http://www.sover.net/~whoi/Internet2.html
ftp://ftp.iif.hu/.ftp1/ftp.simtel.net/pub/simtelnet/win95/htmlmisc/linkchsr.zip


URL Bandit (Xteq) URL Bandit is a handy little program that grabs
Internet and email addresses from text sent to the Windows Clipboard.
The rather basic freeware system doesn't offer any bells and whistles,
but could be useful for gathering URLs for future rather than
immediate use while scanning links pages, ezines, newsletters,
messages, and other materials. It analyzes text sent to the Clipboard;
recognizes Web, FTP, and email addresses; extracts them; and adds them
to a list for future reference and use. Simply click on an entry to
launch your default browser or email program, or right-click to delete
the item. Your lists are automatically retained between sessions.
Download 439Kb
http://www.sover.net/~whoi/Internet4.html
http://www.xteq.com/downloads/xq-urlbandit.zip


WebRefs searches your webpages for hyperlinks and references to images
and scripts, then displays them in a list so you can see where a page
links to. You can also compare the links and references of two files.
http://www.sover.net/~whoi/Internet4.html
http://members.xoom.virgilio.it/thegolino/programmi/internet/prog54/webrefs41.zip
 
http://members.xoom.virgilio.it/thegolino/programmi/internet/prog54/webrefs41.zip

I haven't used it for awhile and don't even know if there's still a freeware
version, but webferret used to be a good little program for doing lots of
research on a given topic from several servers at one time. BUT, while you
could save the search...I never found a way to extract the url's from it.
Finding some program that would extract those url's would be nice. That's
what I mean when I say 'extract urls'. I still have the freeware version of
webferret but I'm not sure it works anymore. I really did like it. It was
fast and gave pages and pages and pages (depending on how many pages you set
it up to and how many search engines you selected).

Helen
 
Helen wrote: BillR wrote: (e-mail address removed) (BillR) wrote in message
Bert Hibberd
I'm looking for a
freeware program that will extract all the hyperlinks from any given page.
<Snip>

While at Son of Spy's site, I incidentally ran across LinkChaser. While
looking for URLs Out!, I also found URL Bandit (Xteq) and WebRefs. I
suspect that there are other programs that might meet your needs listed as
well: I only looked at part of two pages.

BillR

<snipped> . I've seen a few apps do this, but not exactly in the same way.
Needs VB5 runtimes AND MessageHook .dll available HERE. (123Kb) Download
140Kb http://www.sover.net/~whoi/Internet2.html

ftp://ftp.iif.hu/.ftp1/ftp.simtel.net/pub/simtelnet/win95/htmlmisc/linkchsr.zip


URL Bandit (Xteq) URL Bandit is a handy little program that grabs Internet
and email addresses from text sent to the Windows Clipboard. The rather
basic freeware system doesn't offer any bells and whistles, but could be
useful for gathering URLs for future rather than immediate use while
scanning links pages, ezines, newsletters, messages, and other materials. It
analyzes text sent to the Clipboard; recognizes Web, FTP, and email
addresses; extracts them; and adds >>>> them to a list for future reference
and use. Simply click on an entry to launch your default browser or email
program, or right-click to delete the item. Your lists are automatically
retained between sessions. Download 439Kb
http://www.sover.net/~whoi/Internet4.html
http://www.xteq.com/downloads/xq-urlbandit.zip


WebRefs searches your webpages for hyperlinks and references to images and
scripts, then displays them in a list so you can see where a page links to.
You can also compare the links and references of two files.
http://www.sover.net/~whoi/Internet4.html

http://members.xoom.virgilio.it/thegolino/programmi/internet/prog54/webrefs41.zip

I haven't used it for awhile and don't even know if there's still a freeware
version, but webferret used to be a good little program for doing lots of
research on a given topic from several servers at one time. BUT, while you
could save the search...I never found a way to extract the url's from it.
Finding some program that would extract those url's would be nice. That's
what I mean when I say 'extract urls'. I still have the freeware version of
webferret but I'm not sure it works anymore. I really did like it. It was
fast and gave pages and pages and pages (depending on how many pages you set
it up to and how many search engines you selected).

Helen


---
Outgoing mail is certified Virus Free. Checked by AVG anti-virus system
(http://www.grisoft.com). Version: 6.0.512 / Virus Database: 309 - Release
Date: 8/19/03

This email was cleaned by emailStripper, available for free from
http://www.papercut.biz/emailStripper.htm

I just checked and while the ZNet lists version 4 of Webferret as free, I
downloaded it but could not get it to run. I know it used to be a neat
program, but that was THEN, and this is NOW. The thing downloaded and
installed (small) but EVERY connection to every search engine was denied
connection... it may have been something with my proxy settings... but
anyway, for now, I'll just say it used to be a great little program.

Helen
 
Helen said:
http://members.xoom.virgilio.it/thegolino/programmi/internet/prog54/webrefs41.zip

I haven't used it for awhile and don't even know if there's still a freeware
version, but webferret used to be a good little program for doing lots of
research on a given topic from several servers at one time. BUT, while you
could save the search...I never found a way to extract the url's from it.
Finding some program that would extract those url's would be nice. That's
what I mean when I say 'extract urls'. I still have the freeware version of
webferret but I'm not sure it works anymore. I really did like it. It was
fast and gave pages and pages and pages (depending on how many pages you set
it up to and how many search engines you selected).

JetLinks will extract links from an HTML file or text file and turn them
into bookmarks.

<quote from the help file>
imports the URLs name or description, if the HTML source file contains
the following syntax:

<a href="...URL...">URL's name or description</a>
</quote>

About 99 percent accurate in my somewhat limited experience - a few
types of links are a problem. For instance, it doesn't like # in a URL .
..

http://www.manfred-dahlhoff.de/JetLinks

http://www.manfred-dahlhoff.de/download/jlsetup.exe
JetLinks 1.2 complete installation
(2555 KB)(4 December 2000)

Version 1.2.0.5 of JetLinks. Includes english and
german program languages


Susan
 
Thanks Tra@cie. I'll go see what I can find there. Since I have OE6 I
wouldn't have necessarily been looking at anything under OE5.5. BTW do
you know if THIS will work with OE6. In the Web Accessories part of your
post the note states that Web Accessories will not work on IE6. I wonder
if that applies to OE6 too? Probably. Also if any of these IE5 accessories
will work with OE6? I only use OE and not IE except insofar as the latter
is interconnected to the former. By that I mean that while IE is on the
machine, I never access it. If I could use OE without IE being on the
machine I would. I use OE everyday for e-mail and ngs. Thanks for your
time and response.

Helen
 
Tr@cie said:
then Web Accessories from Microsoft
Internet Explorer 5 Web Accessories
download link goes to
http://www.microsoft.com/windows/ie/previous/webaccess/ie5wa.asp

which has the link for "ie5wa.exe" 134 KB file


The direct link is noted in the Pricelessware description:

http://www.pricelessware.org/2003/PL2003SHELL.htm#I109

<quote>
Author/Home Page: Microsoft
http://www.microsoft.com/windows/ie/previous/webaccess/ie5wa.asp
Download: v (134 KB)
http://download.microsoft.com/download/ie5/Utility/1/W98NT42KMeXP/EN-US/ie5wa.exe
</quote>

Susan
 
Susan said:
JetLinks will extract links from an HTML file or text file and turn them
into bookmarks.
<quote from the help file>
imports the URLs name or description, if the HTML source file contains
the following syntax:
<a href="...URL...">URL's name or description</a>
</quote>
About 99 percent accurate in my somewhat limited experience - a few
types of links are a problem. For instance, it doesn't like # in a URL .
.

Hmm. What does it do if it encounters that kind of link? Hose it?
Ignore it?

Take it to dinner and a movie, and try to get to know it better? :)
 
<Snip>

You could also use Xenu's Link Sleuth (pricelessware) and its ilk for
that matter to extract valid URLs by limiting the depth and type of
links.

BillR
 
Blinky said:
Hmm. What does it do if it encounters that kind of link? Hose it?
Ignore it?

Take it to dinner and a movie, and try to get to know it better? :)


I *know* you have JetLinks too . . .
lazy, slothful shark . . .
making me do the work . . .

I just turned on the "monitor the clipboard option" and copied:

http://www.pricelessware.org/2003/PL2003BUSINESS.htm#Calculator

JetLinks popped up with: should I add

http://www.pricelessware.org/2003/

note: this is a local link - the coding is:

<A HREF="PL2003BUSINESS.htm#Calculator" >Calculator </A>

don't remember the other problem URLs I've encountered - but AFAIK it
imports "normal" URLs flawlessly.

Susan
 
I'm looking for a freeware program that will extract all the
hyperlinks from any given page.

I've tried to search through Google, but can't locate anything. Yet
I'm sure that something does exist.

Can anyone help!

WebTools v2.1
<Quote>
The Webtools is a must for any HTML coder or web designer. Ever wanted to
instantly see all the forms, images or styles on a page? Ever wanted to see
how the tables are laid out on your favorite site? Well now you can, and
you can also see them on any page on the net! Webtools manipulates the HTML
that you view, providing reports and that hard to find information that
will make you wonder why you ever 'Viewed Source'.
<snip></snip>

The Webtools have the following functions:
Page Info:
Parsed HTML
Selected HTML
List Images
List Links
List Script Blocks
List Frames
Document Properties
List forms
List Stylesheets
Document Tree
Watch
Navigation:
Go to Selected URL
Up a directory
Change Page:
Resize 640 x 480
Resize 800 x 600
Resize 1024 x 768
Hide Images
Text-Only
Image borders
Cell borders
Table borders
Div borders
Expand All
Show Hidden
Edit Frames
Find:
Mark All
Web Search Selected
Whois Lookup
Netcraft Lookup
</Quote>

I tried it and it works
**** Warning **** AD-ware!!!!
176k Zipped
http://www.iconico.com/webtool/
A couple of other free things there also.
 
Thanks everyone for your suggestions. I haven't quite found what I'm
looking for.

I am redesigning my web site with a totally new look and I wanted to
find a simpler way than retyping each site and its link.

So far the simplest way is to cut and copy, but that entails all sorts
other commands that Frontpage has put with the link and which I
defintely do not want in my new html script.

Thanks!

Bert Hibberd
Australia
 
Blinky said:
Susan said:
Blinky the Shark wrote:
I *know* you have JetLinks too . . .
But...

lazy, slothful shark . . .

...that aside...
making me do the work . . .

...did *you* have to go wait while another computer[1] took ages to boot
Windows, just to check out this one thing? As for me, I'm thinking,
"Noooooooo, you didn't." ;)

[1] An old laptop with a 400MH CPU running at the speed of dark.
I just turned on the "monitor the clipboard option" and copied:

JetLinks popped up with: should I add

note: this is a local link - the coding is:
<A HREF="PL2003BUSINESS.htm#Calculator" >Calculator </A>

Well, that *isn't* very helpful of it, is it? :)


It's interesting to note that it got the directories but not the file
name.
Is it okay with redirection?

dunno :)

Since I couldn't con - er - make that *shame* you into pinning the
import thing down - I did some work myself . . .

There are two problem symbols: # and %

If those occur in a URL Jetlinks will not recognize the URL during an
HTML file import.

It will recognize the URL during a text file import but will truncate
the URL at the offending symbol.

OTOH you can paste that type of URL into Jetlinks and export it to a NS
4.78 bookmarks file and reimport it without problems. IOW import/export
seems to work fine in NS. Don't know about IE or Opera . . .

note: I tried importing the *same* NS bookmark file (with the problem
bookmarks) as an HTML file. Jetlinks could not find any bookmarks.

EXAMPLES: Here are 8 *problem* URLs that I saved in a file.

<a
href="http://download.cnet.com/downloads/...ag=st.dl.10000.sbsr&qt=telephony&cn=&ca=10000">zzz
problem URL zzz</a>

<a
href="http://www.ecircles.com/magic/produ.../all.cgi?dont=cache&DebugMSG=userhasnocookies">zzz
problem URL zzz</a>

<a
href="http://www.godaddy.com/gdshop/default.asp?e=com&&location=www.google.com/search">zzz
problem URL zzz</a>

<a
href="http://www.hotwire.com/index.jsp?hwinimitable=Sat+Sep+23+07:00:16+PDT+2000">zzz
problem URL zzz</a>

<a href="http://www.pbm.com/~lindahl/food.html">zzz problem URL
zzz</a>

<a
href="http://www.veloz.com/features/?from=cart&pg=&faq_trial&ver=online">zzz
problem URL zzz</a>

<a href="http://freeware95.atlnet.com/CATEGORIES...#149">zzz problem URL
zzz</a>

<a href="http://members.aol.com/axcel216/newtip98.htm#HOME">zzz problem
URL zzz</a>


None of these would import as HTML. When I imported as text - from the
file shown above - they were all imported incorrectly.

The list below shows the *original* URL followed by the *imported* URL:

1
http://download.cnet.com/downloads/...ag=st.dl.10000.sbsr&qt=telephony&cn=&ca=10000

http://download.cnet.com/downloads/1,10150,0-10000-103-0-1-7,00.html?tag=st

2
http://www.ecircles.com/magic/produ.../all.cgi?dont=cache&DebugMSG=userhasnocookies

http://www.ecircles.com/magic/products/gate/login.cgi?nextURL=/magic/products/circles/all.cgi

3
http://www.godaddy.com/gdshop/default.asp?e=com&&location=www.google.com/search

http://www.godaddy.com/gdshop/default.asp?e=com&&location=www

4
http://www.hotwire.com/index.jsp?hwinimitable=Sat+Sep+23+07:00:16+PDT+2000
http://www.hotwire.com/index.jsp?hwinimitable=Sat+Sep+23+07

5
http://www.pbm.com/~lindahl/food.html
http://www.pbm.com/

6
http://www.veloz.com/features/?from=cart&pg=&faq_trial&ver=online
http://www.veloz.com/features/?from=cart&pg=

7
http://freeware95.atlnet.com/CATEGORIES...#149
http://freeware95.atlnet.com/CATEGORIES...

8
http://members.aol.com/axcel216/newtip98.htm#HOME
http://members.aol.com/axcel216/newtip98.htm


Susan
 
Susan said:
Blinky the Shark wrote:
Susan Bugher wrote:
Blinky the Shark wrote:
Susan Bugher wrote:
JetLinks will extract links from an HTML file or text file and turn them
into bookmarks.
<quote from the help file>
imports the URLs name or description, if the HTML source file contains
the following syntax:
<a href="...URL...">URL's name or description</a>
</quote>
About 99 percent accurate in my somewhat limited experience - a few
types of links are a problem. For instance, it doesn't like # in a URL .
.
Hmm. What does it do if it encounters that kind of link? Hose it?
Ignore it?
Take it to dinner and a movie, and try to get to know it better? :)
I *know* you have JetLinks too . . .
But...
lazy, slothful shark . . .
...that aside...
making me do the work . . .
...did *you* have to go wait while another computer[1] took ages to boot
Windows, just to check out this one thing? As for me, I'm thinking,
"Noooooooo, you didn't." ;)
[1] An old laptop with a 400MH CPU running at the speed of dark.
I just turned on the "monitor the clipboard option" and copied:
http://www.pricelessware.org/2003/PL2003BUSINESS.htm#Calculator
JetLinks popped up with: should I add
http://www.pricelessware.org/2003/
note: this is a local link - the coding is:
<A HREF="PL2003BUSINESS.htm#Calculator" >Calculator </A>
Well, that *isn't* very helpful of it, is it? :)
It's interesting to note that it got the directories but not the file
name.

Yeah, it is.
Since I couldn't con - er - make that *shame* you into pinning the
import thing down - I did some work myself . . .

Not when I'm not even running Windows (as I mentioned). You port it
to Linux and I'll check on these things. Otherwise, you're going to
have to do some superdoublebig industrial strength shaming. :)
There are two problem symbols: # and %
If those occur in a URL Jetlinks will not recognize the URL during an
HTML file import.
It will recognize the URL during a text file import but will truncate
the URL at the offending symbol.

That be berry, berry bad.
OTOH you can paste that type of URL into Jetlinks and export it to a NS
4.78 bookmarks file and reimport it without problems. IOW import/export
seems to work fine in NS. Don't know about IE or Opera . . .

Sounds like a Civil Service make-work program, to me.
note: I tried importing the *same* NS bookmark file (with the problem
bookmarks) as an HTML file. Jetlinks could not find any bookmarks.
EXAMPLES: Here are 8 *problem* URLs that I saved in a file.
<a href="http://www.pbm.com/~lindahl/food.html">zzz problem URL
zzz</a>
None of these would import as HTML. When I imported as text - from the
file shown above - they were all imported incorrectly.
The list below shows the *original* URL followed by the *imported* URL:

http://download.cnet.com/downloads/1,10150,0-10000-103-0-1-7,00.html?tag=st

It did buy the question mark, though.

Ditto.

And I see more examples of this below, which I will leave
uncommented, but...

....taking this example, just because it's short (and hoping your patience
isn't running short - but *this* should be a quickie) -- what happens
(with HTML or/and text imports) if you add another % to the one that
naturally exists? (Yes, there's a method to this...) IOW, try this:

http://www.pbm.com/%~lindahl/food.html
 
Back
Top