Questions tagged [doc]

Questions about the old Microsoft Word file format and how to use it.

The "doc" format is a proprietary file format used by Microsoft as their main file format for text documents from the 1980s till 2007. With the release of Word 2007 "docx" is the main file format.

All Questions about the older (pre .docx) Wordfiles should use this tag. Questions about .docx files can use this tag, but should use "docx" primarily. Questions about .odf .txt .rdf should not use this tag.

Wikipedia about "doc" http://en.wikipedia.org/wiki/DOC_%28computing%29

Decrypted file format: http://sc.openoffice.org/compdocfileformat.pdf

795 questions
112
votes
16 answers

Is there a Java API that can create rich Word documents?

I have a new app I'll be working on where I have to generate a Word document that contains tables, graphs, a table of contents and text. What's a good API to use for this? How sure are you that it supports graphs, ToCs, and tables? What are some…
billjamesdev
  • 14,133
  • 6
  • 48
  • 72
78
votes
11 answers

How can doc/docx files be converted to markdown or structured text?

Is there a program or workflow to convert .doc or .docx files to Markdown or similar text? PS: Ideally, I would welcome the option that a specific font (e.g. consolas) in the MS Word document will be rendered to text-code: ```....```.
Lorenz Lo Sauer
  • 20,692
  • 12
  • 75
  • 85
53
votes
7 answers

How to extract just plain text from .doc & .docx files?

Anyone know of anything they can recommend in order to extract just the plain text from a .doc or .docx? I've found this - wondered if there were any other suggestions?
docextract
  • 603
  • 1
  • 6
  • 3
49
votes
4 answers

Should I use @return self, this or the current class?

I have a method that return the current object, how do I document this? /** * set something * * @return this */ public function setSomething(){ // ... return $this; } Or should I do @return self or @return…
lucaswxp
  • 1,464
  • 4
  • 15
  • 27
39
votes
3 answers

How to use clojure doc function?

I'm just starting with Clojure and can't access to the doc function. I'm using clojure 1.3 with emacs24 and swank-clojure. user> *clojure-version* {:major 1, :minor 3, :incremental 0, :qualifier nil} But when I try: (doc doc) I get: Unable to…
zaforas
  • 393
  • 3
  • 6
30
votes
6 answers

looking up c++ documentation inside of vim

I code c++, using vim. Often times, I find myself wasting time (and brekaing flow) looking up trivial things like: is std::string.substring does it take (start, length) or (start, end). This often results in open browser; waid; seach on google;…
anon
  • 36,629
  • 47
  • 184
  • 286
28
votes
9 answers

How do I see the Python doc on Linux?

In Windows, Python has a chm type document, and it is very convenient to read. But in the Linux, is there any document let me to read?
Tanky Woo
  • 4,143
  • 7
  • 38
  • 71
23
votes
7 answers

Upload DOC or PDF using PHP

I'm able to upload images fine, but when when I change the types from image/jpg, image/gif to application/msword and application/pdf, it doesn't work. Here's my code. The exact same code works for images, but for uploading docs and pdf, it outputs…
user961627
  • 11,059
  • 35
  • 119
  • 202
21
votes
4 answers

Using Vim to edit Microsoft Word files

I've found ViEmu, a vi emulator for microsoft word. However, I wanted to use vim to edit DOC or even rtf files. Is this possible ? Are they any other formats that preserve page/paragraph layout compatible with both Microsoft Word and Vim? I am also…
Kilon
  • 1,762
  • 2
  • 15
  • 22
21
votes
4 answers

PHP Convert Word file to HTML without losing styling and images

Is there an API for converting word files to HTML without losing the format? Can the google documents API be used for this? I tried saaspose but the returning result is always a server error. Solutions that did not work for me: Converting MS Word…
Herr
  • 2,677
  • 3
  • 28
  • 36
20
votes
1 answer

How read Doc or Docx file in java?

I want to read a word file in java import org.apache.poi.poifs.filesystem.*; import org.apache.poi.hpsf.DocumentSummaryInformation; import org.apache.poi.hwpf.*; import org.apache.poi.hwpf.extractor.*; import…
Ninja
  • 219
  • 1
  • 3
  • 7
20
votes
3 answers

Convert doc to txt via commandline

We're searching a programm that allows us to convert a doc or docx document to a txt file. We're working with linux and we want to start a website that converts user uploaded doc files. We don't wanna use open office/libre office cause we have bad…
user698601
  • 480
  • 1
  • 4
  • 9
20
votes
4 answers

Convert PDF to DOC (Python/Bash)

I've saw some pages that allow user to upload PDF and returns a DOC file, like PdfToWord Is there any way to convert a PDF file to a DOC/DOCX file using Python or any Unix command ? Thanks in advance
AlvaroAV
  • 9,087
  • 10
  • 52
  • 83
20
votes
4 answers

Parse Microsoft Office files in Node.JS

I'm working on a web application where users can upload Microsoft Office Document files. Right now, our server is running Node.JS with Express.js and we're hosted on Heroku. Because of this, I don't think that I can install programs such as abiword…
arknave
  • 583
  • 1
  • 5
  • 16
19
votes
1 answer

can git be used for version control on non text documents such as word doc or xls etc

I've been learning about the GIT version control system recently and It seems to work very well for plain text documents as you can add a single line, go back and fork it, revise the document and remove the line you just added. I program mostly in…
yoshiserry
  • 14,991
  • 24
  • 67
  • 96
1
2 3
52 53