delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/02/05/06:59:55

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-0.4 required=5.0 tests=AWL,BAYES_50
X-Spam-Check-By: sourceware.org
Date: Fri, 5 Feb 2010 13:59:41 +0200
Message-Id: <201002051159.o15BxfJL009697@beta.mvs.co.il>
From: "Ehud Karni" <ehud AT unix DOT mvs DOT co DOT il>
TO: maarten DOT w DOT b DOT vanneste AT gmail DOT com
Cc: cygwin AT cygwin DOT com
Subject: Re: xls to text conversion ?
In-reply-to: <a74950d41002041055y32b83ae9va3e752cd4007dff@mail.gmail.com> (message from Maarten Vanneste on Thu, 4 Feb 2010 19:55:36 +0100)
Reply-to: ehud AT unix DOT mvs DOT co DOT il
References: <a74950d41002041055y32b83ae9va3e752cd4007dff AT mail DOT gmail DOT com>
MIME-Version: 1.0
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On Thu, 4 Feb 2010 19:55:36 Maarten Vanneste wrote:
>
> Dear all,
> I was wondering whether there is a cygwin tool/program which converts
> microsoft excel files to text. Any suggestions ?

I use xlhtml, source at:  http://chicago.sourceforge.net/xlhtml/

I applied the following patch:

diff -c ~/xlhtml-0.5/xlhtml/xlhtml.h-org ~/xlhtml-0.5/xlhtml/xlhtml.h
*** ~/xlhtml-0.5/xlhtml/xlhtml.h-org	Thu Apr 11 03:23:51 2002
--- ~/xlhtml-0.5/xlhtml/xlhtml.h	Thu Dec 24 18:16:51 2009
***************
*** 20,28 ****
  #include "version.h"

  /* Used by packed string array Opcode: 0xFC */
! #define HARD_MAX_ROWS_97	0x7FFE    /*!< Used in add_wb_array to prevent OOM */
! #define HARD_MAX_ROWS_95	0x3FFF    /*!< Used in add_wb_array to prevent OOM */
! #define HARD_MAX_COLS		256	      /*!< Used in add_wb_array to prevent OOM */

  static U16 HARD_MAX_ROWS = HARD_MAX_ROWS_97;
  /**********************************
--- 20,28 ----
  #include "version.h"

  /* Used by packed string array Opcode: 0xFC */
! #define HARD_MAX_ROWS_97	0x0FFFF   /*!< Used in add_wb_array to prevent OOM */
! #define HARD_MAX_ROWS_95	0x07FFF   /*!< Used in add_wb_array to prevent OOM */
! #define HARD_MAX_COLS		256	  /*!< Used in add_wb_array to prevent OOM */

  static U16 HARD_MAX_ROWS = HARD_MAX_ROWS_97;
  /**********************************
***************
*** 57,63 ****
  	U16 type;	/*!< This will record the record type that generated the cell */
  	U16 spanned;		/*!< If 1 don't output */
  	uni_string ustr;	/*!< The cell's displayed contents */
! 	U16 rowspan;		/*!< rows to span */
  	U16 colspan;		/*!< columns to span */
  	uni_string h_link;	/*!< If a hyperlinked cell, this is the link*/
  }cell;
--- 57,63 ----
  	U16 type;	/*!< This will record the record type that generated the cell */
  	U16 spanned;		/*!< If 1 don't output */
  	uni_string ustr;	/*!< The cell's displayed contents */
! 	U32 rowspan;		/*!< rows to span */
  	U16 colspan;		/*!< columns to span */
  	uni_string h_link;	/*!< If a hyperlinked cell, this is the link*/
  }cell;


I did it on GNU/Linux but I think it'll work well on Cygwin.

It worked for several years in our company without the patch, but
lately we started receiving Excel files with 65535 lines in one sheet,
so I had to change the hard coded line limit.

Ehud.


--
 Ehud Karni           Tel: +972-3-7966-561  /"\
 Mivtach - Simon      Fax: +972-3-7976-561  \ /  ASCII Ribbon Campaign
 Insurance agencies   (USA) voice mail and   X   Against   HTML   Mail
 http://www.mvs.co.il  FAX:  1-815-5509341  / \
 GnuPG: 98EA398D <http://www.keyserver.net/>    Better Safe Than Sorry

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019