This is my extended question for the question:
How to read contents of an Table in MS-Word file Using Python?
The solution provided by @YusuMishi is great, but it does not catch the headers in the header and footer.
Let me elaborate on that:
Using the code
import win32com.client as win32
import os
word = win32.Dispatch("Word.Application")
word.Visible = 0
p = os.path.abspath("Catch my tables.docx")
word.Documents.Open(p)
doc = word.ActiveDocument
print doc.Tables.Count
I will get 2
printed (Table 1
and Table 2
)
How do I go through the information in Table 0
and Table N
Accessing Headers and Footers is a bit tricky. Here is how to do it:
You can get the table count this way:
And get the text from cells this way:
Unfortunately I'm not phyton programmer but have some knowledge of MS-Word VBA and objects hierarchy. This would be to much text to put in comment (where I'd rather put this tip).
If you search your table you have to analyse different
Document.StoryRanges
to find your table. There areFooters
andHeaders
but they are additionally divided into different types. So, to find you Table 0 you could use something of this structure:This is VBA code!! I hope you could adjust to your needs. And do it separately for you footers.