How can I dump the entire Web DOM in its current s

2020-03-01 03:30发布

I want to dump the current DOM to a file and be able to view it offline. Essentially, I have an outdated version of a page that I would like to keep around for comparison. As soon as I close my browser, I'm going to lose it so I would like to save the DOM exactly as it is.

There is already an answer for doing this in Firefox but how do I do it in Chrome?

3条回答
混吃等死
2楼-- · 2020-03-01 03:39

I am currently using version 53.0.2785.113 m of Chrome. The other answers no longer appear to be valid. To properly copy all child/descendant elements the user now has to right click on <html> then click "Expand All" before copying. Other wise you will not recursively copy all elements. A normal Ctrl+C will copy everything one <html> has been expanded.

查看更多
姐就是有狂的资本
3楼-- · 2020-03-01 03:43

Using the Web Inspector (F12), go to the Elements tab, right click on the <html> tag in your code and select Copy->Copy outerHTML. Then paste that into a new file and save.

查看更多
孤傲高冷的网名
4楼-- · 2020-03-01 03:59

Command line solution

This is easy to do with newer releases of Chrome:

google-chrome --headless --dump-dom 'http://www.yahoo.com'

(The OP may not have been looking for a command line solution but this search result appears high when searching so others might find it useful)


Original answer 2017

My favorite way to do this is:

docker run -it --rm --name chrome --shm-size=1024m --cap-add=SYS_ADMIN --entrypoint=/usr/bin/google-chrome-unstable yukinying/chrome-headless-browser --headless --disable-gpu --dump-dom https://www.facebook.com

If you're not familiar with how Docker works, be patient - the first time will be slow but subsequent invocations will be quick.


Other information

Tested on

Ubuntu 16

Linux intel-nuc 4.4.0-21-generic #37-Ubuntu SMP Mon Apr 18 18:33:37 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

Docker version:

Docker version 1.10.3, build 20f81dd

Mac OS X Sierra

Darwin MacBook-Pro.local 16.7.0 Darwin Kernel Version 16.7.0: Thu Jun 15 17:36:27 PDT 2017; root:xnu-3789.70.16~2/RELEASE_X86_64 x86_64 i386 MacBookPro14,3 Darwin

Docker version:

Docker version 17.06.1-ce, build 874a737

If you install tidy you can indent the HTML too.

查看更多
登录 后发表回答