Creating a static archive of drupal site ("base" directive gets in the way)
I would like to create a static html archive of my drupal site (on a regular basis, for corporate compliance reasons related to documenting previous states of the publicly published information). I have referred to the drupal manual page at http://drupal.org/node/ 27882. But I have two problems (because I am simply archiving a snapshot of the site, not retiring it, so I don't want to interfere with the normal, drupal-based, dynamic operation). 1. When I use any of the download tools (wget, sitesucker, I've not yet tried httrack) to download the site, the function drupal_get_html_head in file includes/common.inc outputs a directive ". This causes the downloaded pages to contain links which try to open the original site on the Internet, rather than the local file copy. 2. I would like to be logged in as a special user, named "archive", which is configured specially for archive purposes. For instance, it has no permission to search; it displays a custom block that tells that this is an archived version of the website, and states the date on which it was archived, etc. This works for me when I manually login as user "archive". How to get a downloading tool to login as a drupal user (they are good at http authentication, but have no understanding of drupal authentication). Thanks, Cheryl
Unreg me please flyg@kolbro.se kolbro@telia.com _____ Från: support-bounces@drupal.org [mailto:support-bounces@drupal.org] För Cheryl Chase Skickat: den 17 juli 2007 21:11 Till: support@drupal.org Ämne: [support] Creating a static archive of drupal site ("base"directive gets in the way) I would like to create a static html archive of my drupal site (on a regular basis, for corporate compliance reasons related to documenting previous states of the publicly published information). I have referred to the drupal manual page at <http://drupal.org/node/27882> http://drupal.org/node/27882. But I have two problems (because I am simply archiving a snapshot of the site, not retiring it, so I don't want to interfere with the normal, drupal-based, dynamic operation). 1. When I use any of the download tools (wget, sitesucker, I've not yet tried httrack) to download the site, the function drupal_get_html_head in file includes/common.inc outputs a directive ". This causes the downloaded pages to contain links which try to open the original site on the Internet, rather than the local file copy. 2. I would like to be logged in as a special user, named "archive", which is configured specially for archive purposes. For instance, it has no permission to search; it displays a custom block that tells that this is an archived version of the website, and states the date on which it was archived, etc. This works for me when I manually login as user "archive". How to get a downloading tool to login as a drupal user (they are good at http authentication, but have no understanding of drupal authentication). Thanks, Cheryl
On Tuesday 17 July 2007 12:23:00 Ulf Kölbro wrote:
Unreg me please
You can unreg yourself here: http://lists.drupal.org/listinfo/support -- Jason Flatt http://www.oadaeh.net/ Father of Six: http://www.flattfamily.com/ (Joseph, 14; Cramer, 12; Travis, 10; Angela; Harry, 7; and William, 12:04 am, 12-29-2005) Linux User: http://www.xubuntu.org/ Drupal Fanatic: http://drupal.org/
Ulf Kölbro wrote:
Unreg me please
To unsubscribe from the list, click on the link at the bottom of every message: [ Drupal support list | http://lists.drupal.org/ ] Find the list you want to unsubscribe from, and click on its link. At the bottom of the page you'll see an 'unsubscribe' button. Enter your email address there, click that and complete the process by following the prompts.
On Tuesday 17 July 2007, Cheryl Chase wrote:
I would like to create a static html archive of my drupal site (on a regular basis, for corporate compliance reasons related to documenting previous states of the publicly published information).
I have referred to the drupal manual page at http://drupal.org/node/ 27882. But I have two problems (because I am simply archiving a snapshot of the site, not retiring it, so I don't want to interfere with the normal, drupal-based, dynamic operation).
1. When I use any of the download tools (wget, sitesucker, I've not yet tried httrack) to download the site, the function drupal_get_html_head in file includes/common.inc outputs a directive ". This causes the downloaded pages to contain links which try to open the original site on the Internet, rather than the local file copy.
I'm afraid the directive didn't come through. Which one do you mean? :-)
2. I would like to be logged in as a special user, named "archive", which is configured specially for archive purposes. For instance, it has no permission to search; it displays a custom block that tells that this is an archived version of the website, and states the date on which it was archived, etc. This works for me when I manually login as user "archive". How to get a downloading tool to login as a drupal user (they are good at http authentication, but have no understanding of drupal authentication).
If you log in as that user, you can check in the database and see what the session variable is for that user. wget (and probably the others) can be set to send a specific cookie with each request, and you can just give it that value. See the man page for the exact syntax, as I don't recall it at the moment. -- Larry Garfield AIM: LOLG42 larry@garfieldtech.com ICQ: 6817012 "If nature has made any one thing less susceptible than all others of exclusive property, it is the action of the thinking power called an idea, which an individual may exclusively possess as long as he keeps it to himself; but the moment it is divulged, it forces itself into the possession of every one, and the receiver cannot dispossess himself of it." -- Thomas Jefferson
Quoting Cheryl Chase <cheryl@mathiasconsulting.com>:
I would like to create a static html archive of my drupal site (on a regular basis, for corporate compliance reasons related to documenting previous states of the publicly published information).
Would node revisions be sufficient for that? See http://groups.drupal.org/taxonomy/term/1032/ for reference. Earnie -- http://for-my-kids.com/
participants (6)
-
Cheryl Chase -
Earl Miles -
Earnie Boyd -
Jason Flatt -
Larry Garfield -
Ulf Kölbro