How do we save linked webpages?

  • Thread starter Thread starter Blue Max
  • Start date Start date
B

Blue Max

Some companies provide product manuals or guides in HTML format on their
webpage versus .PDF format. The .PDF manual is easy to download as a
reference. However, what if the manual is in HTML format? Is there a way
to download the webpage for the manual, including all the webpages linked to
the table of contents on the main page that comprise the manual?
 
Please state your IE version and full Windows version (e.g., WinXP SP3;
Vista SP1) when posting to this newsgroup.
 
Hello Robear,

As we posted to the Windows Vista forum, you probably guessed that we are
using Vista. Furthermore, we are using Vista Utlimate (SP1) to be more
precise. We are also using Internet Explorer 7 as a browser.

Basically, we want to get the full HTML version of the manual onto our
computer for future reference, just as we would a .PDF manual, provided that
is possible. To clarify, we do not want just the URL links on our computer,
but the full-fledged HTML pages. Furthermore, we would be interested in
knowing if we could convert an HTML manual to a .PDF, for example.

Thanks,

Richard

*************
 
[Crossposted to IE General newsgroup]

The you or the end-user could simply save the page via IE: File | Save as |
Save as type | webpage, complete (*.htm; *.html).

There are many third-party applications available which can be used to
convert an HTML file to a PDF file (e.g., Adobe Acrobat; Foxit software).
There's even a "Save as PDF" Add-in for Office 2007.

NB: You did not post to a Windows Vista forum, you posted to an IE6-specific
newsgroup. Those with IE7-specific questions or comments are asked to post
to and seek support in this newsgroup:
microsoft.public.internetexplorer.general

On the web:
http://www.microsoft.com/communitie...?dg=microsoft.public.internetexplorer.general

In Windows Mail newsreader:
news://msnews.microsoft.com/microsoft.public.internetexplorer.general
 
Thank you, Robear, your reply was most helpful. Sorry about the posting, I
was initially unable to find anything other than the Internet Explorer 6.0
newsgroup, so am glad you were roving around. The newsgroup names are not
always consistent from version to version which sometimes makes it hard to
find the proper group. I also thought I had concurrently posted to Vista
also, but it appears I had not, sorry.

Thank you for your helpful information. From what you have said, we will
assume that using the "Save As" webpage complete will save the desired
webpage and all pages linked to that page. Just a few more question if we
may:

FIRST, what about webpages linked to the linked webpages? Does this feature
only save the links one level deep (i.e., linked to the first page) or does
it save the full network of webpages linked to the first webpage.

SECOND, what is the nature of saving the webpage as a "Web Archive, single
file (*.mht)? Will that also save the webpage and all linked webpages, but
as one file? If so, this may be a nice alternative.

THIRD, can the saved HTML be imported into Microsoft Word 2007 and then
saved as a .PDF using the addin for Word? If not, I do have .PDF printers
that should print the webpage as a .PDF, but they will not print the linked
pages at the same time, as far as I know.

Thank you for your assistance!

Richard

***************
PA Bear said:
[Crossposted to IE General newsgroup]

The you or the end-user could simply save the page via IE: File | Save as
| Save as type | webpage, complete (*.htm; *.html).

There are many third-party applications available which can be used to
convert an HTML file to a PDF file (e.g., Adobe Acrobat; Foxit software).
There's even a "Save as PDF" Add-in for Office 2007.

NB: You did not post to a Windows Vista forum, you posted to an
IE6-specific newsgroup. Those with IE7-specific questions or comments are
asked to post to and seek support in this newsgroup:
microsoft.public.internetexplorer.general

On the web:
http://www.microsoft.com/communitie...?dg=microsoft.public.internetexplorer.general

In Windows Mail newsreader:
news://msnews.microsoft.com/microsoft.public.internetexplorer.general
--
~PA Bear

Blue said:
Hello Robear,

As we posted to the Windows Vista forum, you probably guessed that we are
using Vista. Furthermore, we are using Vista Utlimate (SP1) to be more
precise. We are also using Internet Explorer 7 as a browser.

Basically, we want to get the full HTML version of the manual onto our
computer for future reference, just as we would a .PDF manual, provided
that
is possible. To clarify, we do not want just the URL links on our
computer,
but the full-fledged HTML pages. Furthermore, we would be interested in
knowing if we could convert an HTML manual to a .PDF, for example.
 
Just an update Robear,

After some experimentation, it appears that the option you suggest only
saves the active webpage and not any of the linked webpages. I tested this
by saving the complete webpage, disabling my internet connection, and then
opening the saved webpage and trying to access the webpage links.
Unfortunately, the linked webpages are not available, only the webpage we
saved.

As such, we are still facing the same dilemma as to how to save a webpage,
and all its linked webpages, in the same operation. Individually saving
each webpage, is always tedious and sometimes almost impossible.

For example, product sites often have a webpage for FAQ's, but each question
is a link to a separate webpage that addresses each FAQ. Another example,
are online product manuals that have a 'Table of Contents' webpage that
links the user to each chapter which, like the FAQ's, are separate linked
webpages.

Of course, our original question centered around this key issue. Any ideas
on how how we can download these .HTML reference resources in a single
download?

Thanks again,

Richard

***************
PA Bear said:
[Crossposted to IE General newsgroup]

The you or the end-user could simply save the page via IE: File | Save as
| Save as type | webpage, complete (*.htm; *.html).

There are many third-party applications available which can be used to
convert an HTML file to a PDF file (e.g., Adobe Acrobat; Foxit software).
There's even a "Save as PDF" Add-in for Office 2007.

NB: You did not post to a Windows Vista forum, you posted to an
IE6-specific newsgroup. Those with IE7-specific questions or comments are
asked to post to and seek support in this newsgroup:
microsoft.public.internetexplorer.general

On the web:
http://www.microsoft.com/communitie...?dg=microsoft.public.internetexplorer.general

In Windows Mail newsreader:
news://msnews.microsoft.com/microsoft.public.internetexplorer.general
--
~PA Bear

Blue said:
Hello Robear,

As we posted to the Windows Vista forum, you probably guessed that we are
using Vista. Furthermore, we are using Vista Utlimate (SP1) to be more
precise. We are also using Internet Explorer 7 as a browser.

Basically, we want to get the full HTML version of the manual onto our
computer for future reference, just as we would a .PDF manual, provided
that
is possible. To clarify, we do not want just the URL links on our
computer,
but the full-fledged HTML pages. Furthermore, we would be interested in
knowing if we could convert an HTML manual to a .PDF, for example.
 
I think you need to hire someone with experience in this field, Max.

Blue said:
Just an update Robear,

After some experimentation, it appears that the option you suggest only
saves the active webpage and not any of the linked webpages. I tested
this
by saving the complete webpage, disabling my internet connection, and then
opening the saved webpage and trying to access the webpage links.
Unfortunately, the linked webpages are not available, only the webpage we
saved.

As such, we are still facing the same dilemma as to how to save a webpage,
and all its linked webpages, in the same operation. Individually saving
each webpage, is always tedious and sometimes almost impossible.

For example, product sites often have a webpage for FAQ's, but each
question
is a link to a separate webpage that addresses each FAQ. Another example,
are online product manuals that have a 'Table of Contents' webpage that
links the user to each chapter which, like the FAQ's, are separate linked
webpages.

Of course, our original question centered around this key issue. Any
ideas
on how how we can download these .HTML reference resources in a single
download?

Thanks again,

Richard

***************
PA Bear said:
[Crossposted to IE General newsgroup]

The you or the end-user could simply save the page via IE: File | Save as
Save as type | webpage, complete (*.htm; *.html).

There are many third-party applications available which can be used to
convert an HTML file to a PDF file (e.g., Adobe Acrobat; Foxit software).
There's even a "Save as PDF" Add-in for Office 2007.

NB: You did not post to a Windows Vista forum, you posted to an
IE6-specific newsgroup. Those with IE7-specific questions or comments
are
asked to post to and seek support in this newsgroup:
microsoft.public.internetexplorer.general

On the web:
http://www.microsoft.com/communitie...?dg=microsoft.public.internetexplorer.general

In Windows Mail newsreader:
news://msnews.microsoft.com/microsoft.public.internetexplorer.general
--
~PA Bear

Blue said:
Hello Robear,

As we posted to the Windows Vista forum, you probably guessed that we
are
using Vista. Furthermore, we are using Vista Utlimate (SP1) to be more
precise. We are also using Internet Explorer 7 as a browser.

Basically, we want to get the full HTML version of the manual onto our
computer for future reference, just as we would a .PDF manual, provided
that
is possible. To clarify, we do not want just the URL links on our
computer,
but the full-fledged HTML pages. Furthermore, we would be interested in
knowing if we could convert an HTML manual to a .PDF, for example.

Please state your IE version and full Windows version (e.g., WinXP SP3;
Vista SP1) when posting to this newsgroup.
--
~Robear Dyer (PA Bear)
MS MVP-IE, Mail, Security, Windows Desktop Experience - since 2002
AumHa VSOP & Admin http://aumha.net
DTS-L http://dts-l.net/

Blue Max wrote:
Some companies provide product manuals or guides in HTML format on
their
webpage versus .PDF format. The .PDF manual is easy to download as a
reference. However, what if the manual is in HTML format? Is there a
way
to download the webpage for the manual, including all the webpages
linked
to
the table of contents on the main page that comprise the manual?
 
Hello PA Bear,

A little research is showing that others face the same problem. Some
possible tools include the Microsoft Internet Explorer 'WebRipper' add-in,
the WinHTTrack freeware program, and others for Firefox also. The
WinHTTrack program looks like it will work, but is very complex. I would
like to try the Microsoft add-in, but cannot get it to load properly yet in
Internet Explorer 8 (Beta).

Thanks for your help.

****************
PA Bear said:
I think you need to hire someone with experience in this field, Max.

Blue said:
Just an update Robear,

After some experimentation, it appears that the option you suggest only
saves the active webpage and not any of the linked webpages. I tested
this
by saving the complete webpage, disabling my internet connection, and
then
opening the saved webpage and trying to access the webpage links.
Unfortunately, the linked webpages are not available, only the webpage we
saved.

As such, we are still facing the same dilemma as to how to save a
webpage,
and all its linked webpages, in the same operation. Individually saving
each webpage, is always tedious and sometimes almost impossible.

For example, product sites often have a webpage for FAQ's, but each
question
is a link to a separate webpage that addresses each FAQ. Another
example,
are online product manuals that have a 'Table of Contents' webpage that
links the user to each chapter which, like the FAQ's, are separate linked
webpages.

Of course, our original question centered around this key issue. Any
ideas
on how how we can download these .HTML reference resources in a single
download?

Thanks again,

Richard

***************
PA Bear said:
[Crossposted to IE General newsgroup]

The you or the end-user could simply save the page via IE: File | Save
as
Save as type | webpage, complete (*.htm; *.html).

There are many third-party applications available which can be used to
convert an HTML file to a PDF file (e.g., Adobe Acrobat; Foxit
software).
There's even a "Save as PDF" Add-in for Office 2007.

NB: You did not post to a Windows Vista forum, you posted to an
IE6-specific newsgroup. Those with IE7-specific questions or comments
are
asked to post to and seek support in this newsgroup:
microsoft.public.internetexplorer.general

On the web:
http://www.microsoft.com/communitie...?dg=microsoft.public.internetexplorer.general

In Windows Mail newsreader:
news://msnews.microsoft.com/microsoft.public.internetexplorer.general
--
~PA Bear

Blue Max wrote:
Hello Robear,

As we posted to the Windows Vista forum, you probably guessed that we
are
using Vista. Furthermore, we are using Vista Utlimate (SP1) to be more
precise. We are also using Internet Explorer 7 as a browser.

Basically, we want to get the full HTML version of the manual onto our
computer for future reference, just as we would a .PDF manual, provided
that
is possible. To clarify, we do not want just the URL links on our
computer,
but the full-fledged HTML pages. Furthermore, we would be interested
in
knowing if we could convert an HTML manual to a .PDF, for example.

Please state your IE version and full Windows version (e.g., WinXP
SP3;
Vista SP1) when posting to this newsgroup.
--
~Robear Dyer (PA Bear)
MS MVP-IE, Mail, Security, Windows Desktop Experience - since 2002
AumHa VSOP & Admin http://aumha.net
DTS-L http://dts-l.net/

Blue Max wrote:
Some companies provide product manuals or guides in HTML format on
their
webpage versus .PDF format. The .PDF manual is easy to download as a
reference. However, what if the manual is in HTML format? Is there
a
way
to download the webpage for the manual, including all the webpages
linked
to
the table of contents on the main page that comprise the manual?
 
Thanks, Robear. I am certainly inclined to agree with you regarding the
Internet Explorer 8 Beta. I installed the Explorer 8 Beta with some
reservation, however some of the tools we wanted to test indicated they
required Explorer 8. An interesting note here, is that all the Microsoft
links for Internet Explorer add-ins appeared to take us to the I.E. 8
webpage? In consequence, we will roll back to Explorer 7 and wait for the
final release of 8.

As for the ripper software, it also looks to still be in its infancy; at
very least, a little more technical than we want to deal with at this time.
Nonetheless, thank you for the web link directly to Calluna, we will
continue to watch that product. Regardless, we really wish there was a
simple tool for accomplishing what we originally needed. I do think there
would be some universal good to a simple program that would download HTML
manuals, guides, FAQ's, and troubleshooting resources for customers owning
the product.

Thanks,

Richard

*******************
 
...I installed the Explorer 8 Beta with some
reservation, however some of the tools we wanted to test indicated they
required Explorer 8.

What tools?

They required IE8 or the tools threow an error telling you that you needed
to upgrade IE? (Big difference.)
 
Back
Top