Finding text inside Word Docs without using Word 
Author Message
 Finding text inside Word Docs without using Word

Is there a way to find text inside Word documents, on a computer where Word
is not available (only WordView), and where i can't use Find (advanced) from
W95.

Is it possible to create a prog witch can do this??

Any suggestion are welcome....

Stephan

--

To send E-mail, remove the spamblock ".rotzooi"
Om E-mail te sturen, verwijder de rotzooi.



Sun, 22 Oct 2000 03:00:00 GMT  
 Finding text inside Word Docs without using Word

Get yourself a hex viewer and look at a Word document.  You will find that
text is stored simply as text.  Therefore, opening the Word document in
binary mode and using Instr works like a charm.

Quote:

>Is there a way to find text inside Word documents, on a computer where Word
>is not available (only WordView), and where i can't use Find (advanced)
from
>W95.

>Is it possible to create a prog witch can do this??

>Any suggestion are welcome....

>Stephan

>--

>To send E-mail, remove the spamblock ".rotzooi"
>Om E-mail te sturen, verwijder de rotzooi.



Sun, 22 Oct 2000 03:00:00 GMT  
 Finding text inside Word Docs without using Word

Quote:

>Get yourself a hex viewer and look at a Word document.  You will find that
>text is stored simply as text.  Therefore, opening the Word document in
>binary mode and using Instr works like a charm.

Well, yes and no. Yes, Word docs are usually saved in  binary format that
nonetheless allows text grepping, but often the structure (and text) is
screwy. So
you should have no trouble seeing if a given word appears *someplace* in the
doc, but large-scale parsing is a drag.

Plus, Word97 docs are often saved as RTF if the user wants it to be
readable in Word95. So,  you'll find lots of {\pard stuff.

I've been parsing Word docs using perl on a Unix box, running into all sorts
of fun.

James

Quote:

>>Is there a way to find text inside Word documents, on a computer where
Word
>>is not available (only WordView), and where i can't use Find (advanced)
>from
>>W95.

>>Is it possible to create a prog witch can do this??

>>Any suggestion are welcome....

>>Stephan

>>--

>>To send E-mail, remove the spamblock ".rotzooi"
>>Om E-mail te sturen, verwijder de rotzooi.



Sun, 22 Oct 2000 03:00:00 GMT  
 Finding text inside Word Docs without using Word

Where is the documentation for the .DOC format, or does M$ want a pile of
money for that information?

Anyone know?

Dan Moran

Quote:


>>Get yourself a hex viewer and look at a Word document.  You will find that
>>text is stored simply as text.  Therefore, opening the Word document in
>>binary mode and using Instr works like a charm.

>Well, yes and no. Yes, Word docs are usually saved in  binary format that
>nonetheless allows text grepping, but often the structure (and text) is
>screwy. So
>you should have no trouble seeing if a given word appears *someplace* in
the
>doc, but large-scale parsing is a drag.

>Plus, Word97 docs are often saved as RTF if the user wants it to be
>readable in Word95. So,  you'll find lots of {\pard stuff.

>I've been parsing Word docs using perl on a Unix box, running into all
sorts
>of fun.

>James


>>>Is there a way to find text inside Word documents, on a computer where
>Word
>>>is not available (only WordView), and where i can't use Find (advanced)
>>from
>>>W95.

>>>Is it possible to create a prog witch can do this??

>>>Any suggestion are welcome....

>>>Stephan

>>>--

>>>To send E-mail, remove the spamblock ".rotzooi"
>>>Om E-mail te sturen, verwijder de rotzooi.



Mon, 23 Oct 2000 03:00:00 GMT  
 Finding text inside Word Docs without using Word

From Dr GUI's column this month on MSDN

The binary file formats for the Microsoft Office 97 programs are available
in the MSDN Library, either on your subscription CDs or online at the MSDN
Web site. If you're not an online member (it's free), you'll need to
register before you can access the Library.

To access the binary file format information, select Microsoft Office
Development from the contents directory. Click to open Office, and then
click again to open Microsoft Office 97 Binary File Formats. Select the
appropriate category, such as "Microsoft Word 97 Binary File Format" or
"Microsoft Excel File Format."

Regards,
SimonJ
PC Pro Magazine



Mon, 23 Oct 2000 03:00:00 GMT  
 Finding text inside Word Docs without using Word

You can check here.

Patrick


Quote:
>Where is the documentation for the .DOC format, or does M$ want a pile of
>money for that information?

>Anyone know?

>Dan Moran



>>>Get yourself a hex viewer and look at a Word document.  You will find
that
>>>text is stored simply as text.  Therefore, opening the Word document in
>>>binary mode and using Instr works like a charm.

>>Well, yes and no. Yes, Word docs are usually saved in  binary format that
>>nonetheless allows text grepping, but often the structure (and text) is
>>screwy. So
>>you should have no trouble seeing if a given word appears *someplace* in
>the
>>doc, but large-scale parsing is a drag.

>>Plus, Word97 docs are often saved as RTF if the user wants it to be
>>readable in Word95. So,  you'll find lots of {\pard stuff.

>>I've been parsing Word docs using perl on a Unix box, running into all
>sorts
>>of fun.

>>James


>>>>Is there a way to find text inside Word documents, on a computer where
>>Word
>>>>is not available (only WordView), and where i can't use Find (advanced)
>>>from
>>>>W95.

>>>>Is it possible to create a prog witch can do this??

>>>>Any suggestion are welcome....

>>>>Stephan

>>>>--

>>>>To send E-mail, remove the spamblock ".rotzooi"
>>>>Om E-mail te sturen, verwijder de rotzooi.

begin 666 Wotsit's File Format Collection.url
M6TEN=&5R;F5T4VAO<G1C=71=#0I54DP]:'1T<#HO+W=W=RYW;W1S:70N9&5M

`
end
end


Wed, 25 Oct 2000 03:00:00 GMT  
 
 [ 6 post ] 

 Relevant Pages 

1. Finding text inside Word Docs without using Word

2. Finding text inside Word Docs without using Word

3. Finding text inside Word doc without Word?

4. Need OCX for viewing Word Docs without Word

5. Merge & print Word docs without Word

6. Find multiple words Using MS Word Find

7. Using Word.Application without installing Word

8. find/replace in a Word Doc using word object and vbscript

9. Convert Word docs to Word forms??

10. VB5 HARD ONE:Locking Word OLE session to other Word Docs

11. Pulling text from multiple word docs into a new doc

12. Inserting text from database to word docs

 

 
Powered by phpBB® Forum Software