Display URL in saved html files

  • Thread starter Thread starter Pete
  • Start date Start date
P

Pete

Hi there!

I save a lot of html files from the websites while doing my research.
Sometimes I need to revisit the sites where I got the html files but I
can't find out the URL from the saved html file.

Is there some software out there that can read from the saved html
file and tell me the URL?

Thanks.

Pete
 
Pete said:
I save a lot of html files from the websites while doing my research.
Sometimes I need to revisit the sites where I got the html files but I
can't find out the URL from the saved html file.

Is there some software out there that can read from the saved html
file and tell me the URL?
Mozilla (1.5) does keep the history of all your saved files (pdf, ... and
even html). But the file itself normaly does not contain informations
about its origin.

Bernd
 
| I save a lot of html files from the websites while doing my research.
| Sometimes I need to revisit the sites where I got the html files
| but I can't find out the URL from the saved html file.
|
| Is there some software out there that can read from the saved html
| file and tell me the URL?

Pete ....

Internet Explorer includes the saved from url
in an HTML comment string when using ....

File .... Save As ....

<!-- saved from url=(0031)http://www.onlamp.com/lpt/a/923 -->

I don't know of a program that looks for and uses these,
but you can find them using an editor or View .... Source
from the browser ....
 
Cousin said:
Internet Explorer includes the saved from url
in an HTML comment string when using ....

File .... Save As ....

<!-- saved from url=(0031)http://www.onlamp.com/lpt/a/923 -->

Not the case here. IE6, using Save As/html only.

Opera has a nice Note feature. The url is displayed in a
bubble-up help balloon, but not in the note's text. It can be
fetched from Opera's Note file, however.

p.
 
pb said:
Cousin Stanley wrote:




Not the case here. IE6, using Save As/html only.

Opera has a nice Note feature. The url is displayed in a
bubble-up help balloon, but not in the note's text. It can be
fetched from Opera's Note file, however.

p.

URL Bandit (Xteq) URL Bandit is a handy little program that grabs Internet
and email addresses from text sent to the Windows Clipboard. The rather
basic freeware system doesn't offer any bells and whistles, but could be
useful for gathering URLs for future rather than immediate use while
scanning links pages, ezines, newsletters, messages, and other materials.
It analyzes text sent to the Clipboard; recognizes Web, FTP, and email
addresses; extracts them; and adds them to a list for future reference and
use. Simply click on an entry to launch your default browser or email
program, or right-click to delete the item. Your lists are automatically
retained between sessions. ~439 on my Internet4 page or HERE:
http://www.xteq.com/downloads/xq-urlbandit.zip

Worth a try...

Son Of Spy
--

Read the latest in the Scumsucking SPCA saga HERE:
http://www.sover.net/~wysiwygx/SPCAScum.html

http://www.sover.net/~wysiwygx/index.html
. --- . . - - - - - - - - - - - -
/ SOS \ __ / Freeware - - - - - -
/ / \ ( ) / - - - - -
/ / / / / / / \/ \ - - - -
/ / / / / / / : : - - -
/ / / / / ' ' - -
/ / //..\\
=====UU==UU=====
'///||\\\'
' '' '
 
Pete said:
I save a lot of html files from the websites while doing my research.
Sometimes I need to revisit the sites where I got the html files but I
can't find out the URL from the saved html file.

If you use Proxomitron, try either of these two filters. I've used the
first one for YEARS very successfully. Someone else created the second
one...which is a little more modern (using "Start" instead of "Body") and
additionally adds the date and time. Try them both and see which works
for you the best.


Name = "<BODY> Inject URL 2
Active = TRUE
Multi = TRUE
Limit = 256
Match = "<body\1>"
Replace = "<body\1> PAGE URL:<a href="\u">\u</a><br>"

Name = "<BODY> Inject URL 2 (With Date and Start)"
Active = FALSE
Multi = TRUE
Limit = 256
Match = "<start>"
Replace = "PAGE URL: <a href="\u">\u</a><br>ACCESS: $DTM(Y/M/D h:m a)"
 
I use Proxomitron especially for its ability to put the URL and date/time of
access at the top of web pages when they are displayed and saved.

There is also a utility called InjectURL that will insert the URL -
http://www.tranglos.com/free/injecturl.html
- but I prefer to use Proxomitron with an "inject URL" filter.
 
Internet Explorer includes the saved from url
in an HTML comment string when using ....

File .... Save As ....

<!-- saved from url=(0031)http://www.onlamp.com/lpt/a/923 -->

| Not the case here. IE6, using Save As/html only.

pb ....

I also use IE6, so I'm guessing the difference
is probably that I use ... Save As ... Web Page, complete ....
 
Hi Jason!

If you use Proxomitron, try either of these two filters. I've used the
first one for YEARS very successfully. Someone else created the second
one...which is a little more modern (using "Start" instead of "Body") and
additionally adds the date and time. Try them both and see which works
for you the best.


Name = "<BODY> Inject URL 2
Active = TRUE
Multi = TRUE
Limit = 256
Match = "<body\1>"
Replace = "<body\1> PAGE URL:<a href="\u">\u</a><br>"

Name = "<BODY> Inject URL 2 (With Date and Start)"
Active = FALSE
Multi = TRUE
Limit = 256
Match = "<start>"
Replace = "PAGE URL: <a href="\u">\u</a><br>ACCESS: $DTM(Y/M/D h:m a)"

Thank you very much. Exactly what I was looking for. I've heard of
Proxomitron before but never knew that it can add the URL to the page.
I'm using the second filter that you've provided above. I also tried
InjectURL (mentioned by another poster), but unfortunately it only
works with IE and not Opera which I use as my default browser.

I'd also like to thank everyone who offered other suggestions.

Pete
 
Cousin said:
I also use IE6, so I'm guessing the difference
is probably that I use ... Save As ... Web Page, complete ....

Thinking about it, that would make a bit of sense.

p.
 
Pete said:
Hi there!

I save a lot of html files from the websites while doing my research.
Sometimes I need to revisit the sites where I got the html files but I
can't find out the URL from the saved html file.

Is there some software out there that can read from the saved html
file and tell me the URL?

Thanks.

Pete

If you download the files with WinHTTrack it can be configured to save the
URL (base site and file name) at the head of the "source" file. A typical
example:

<!-- Mirrored by: HTTrack Website Copier/3.x. Site: www.straightdope.com.
File: /columns/030314.html. Date: Thu, 09 Oct 2003 20:52:02 GMT -->

The URL for WinHTTrack is:
http://www.httrack.com

===

Frank Bohan
¶ The squeaky wheel gets the grease.
 
Back
Top