X-Authentication-Warning: delorie.com: mail set sender to djgpp-bounces using -f Message-Id: <201902030713.x137DSFN011258@delorie.com> Date: Sat, 02 Feb 2019 21:17:24 +0100 From: "Juan Manuel Guerrero (juan DOT guerrero AT gmx DOT de) [via djgpp-announce AT delorie DOT com]" To: djgpp-announce AT delorie DOT com Subject: ANNOUNCE: DJGPP port of PCRE2 10.31 uploaded. Content-Type: text/plain; charset=ISO-8859-15; format=flowed Reply-To: djgpp AT delorie DOT com This is a port of PCRE2 10.31 to MSDOS/DJGPP. The PCRE2 library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5. PCRE2 has its own native API, as well as a set of wrapper functions that correspond to the POSIX regular expression API. PCRE2 is a re-working of the original PCRE library to provide an entirely new API. It is written in C, and there are no C++ wrappers anymore. The original, very widely deployed PCRE library, is at version 8.42, and the API and feature set are stable future releases will be for bugfixes only. All new future features will be to PCRE2, not the original PCRE 8.x series. DJGPP specific changes. ======================= To configure and compile this port you will need an OS with LFN support. The products itself will even work on systems that have only SFN support. The usual stuff to configure the sources have been added. As usual, it is stored together with the diffs file that documents my changes in the /djgpp directory. The port has been configured to support gzip and bzip2 compressed files. The binaries will detect at runtime if LFN or SFN support is available. If there is LFN support available then only the default extensions ".bz2" and ".gz" will be used and no other ones will be honored. But if only SFN support is available, then also ".*bz" will be accepted as a valid bzip2 extension and in the case of gzip compressed files also ".*gz" and ".**z" will be accepted as valid extensions, where "*" always stands for any valid character. If a file with a valid bzip2 or gzip extension can not be opened using the appropriate compressor library functions, it will be treated as an uncopressed file and will be opened as a plain file. Please note that all files are opened in binary mode and that neither the original code nor this port offers any way to change this behaviour. The library can be configured to accept different EOL characters. I have choosen to configure this port using the option to enable any EOL characters. This means that CR, LF and CRLF will be recognized as valid EOL character. If you do not like this, you will have to reconfigure and recompile the port passing the option you prefer to config.bat. The following command line options are available: cr: enables CR as EOL lf: enables LF as EOL crlf: enables CRLF as EOL any-crlf: enables CR, LF and CRLF as EOL Also the following command line options are available to disable any of the EOL characters: no-cr: disables CR as EOL no-lf: disables LF as EOL no-crlf: disables CRLF as EOL no-any-crlf: disables CR, LF and CRLF as EOL Enabling one of them disables all of the other ones. This concerns only the library. The EOL character used by pcre2grep can always be controlled with the -N command line option. Please note that this choice has the consequence that if you have a string looking like this: foo\r\nbar PCRE library and pcre2grep.exe will find 2 EOLs. One for \r and one for \n. In other words the following two sequences "CRCRLF" and "LFCRLF" will always produce two EOL matches. The "CRCRLF" sequence will produce one match for the first CR and a second one for the following CRLF. The "LFCRLF" sequence will produce a match for the first LF and a second match for the following CRLF. Please note that this behaviour differs from the DJGPP's port of grep. It was not my intention to modify the PCRE code in such a drastic way to be able to emulation DJGPP's grep behaviour. pcre2grep.exe also offers color support without having to install an ansi.sys driver. Please also note that I have not configured the port to support neither UTF-8 Unicode character strings nor any UTF-8 EOL character sequence. There are more options like this: pcre8 or no-pcre8, default pcre8. Enable 8 bit character set support and disable 16 bit character set support. pcre16 or no-pcre16, default no-pcre16. Disable 8 bit and 32 character set support and enables 16 bit character set support. DJGPP does not support 16 bit character sets AFAIK. pcre32 or no-pcre32, default no-pcre32. Disable 8 bit and 16 character set support and enables 32 bit character set support. DJGPP does not support 32 bit character sets AFAIK. jit or no-jit, default no-jit. Disable Just-In-Time compiling support. grepjit or no-grepjit, default no-grepjit. Disable Just-In-Time support in pcre2grep. Neither the Just-In-Time compiling support for the library nor the JIT support for pcre2grep are supported by me. I have checked the code to see if there are DOS specific issues to fix but I did not found any. To compile it you will need to install some pthread library together with a socket library. I have never used that kind of libraries on DOS with DJGPP so I am not able to support JIT. If you want to try you are alone. The pcre2test.exe binary will not support the -S command line option that allows to change the program stack. This is because DJGPP's setrlimit does not support this feature. The port has been configure to use the readline and history libraries. If you do not like this you will have to reconfigure the sources passing the "no-rl" command line option to config.bat. The default is always that readline is used. The port passes the test suite except for the last test. This test is completely UNIX or LF centric and does not work well with any other EOL encoding than the one used on posix systems. This is known by the author and maintainer of PCRE library and I have no plans to write a CRLF specific test case for the DJGPP port. Certain man and html pages have been renamed to fit into the SFN limits. The index.html has been adjusted accordingly. As told before, to configure and compile the package you will have to install the following packages too: ftp://ftp.delorie.com/pub/djgpp/current/v2gnu/rdln70b.zip ftp://ftp.delorie.com/pub/djgpp/current/v2tk/zlb1211b.zip ftp://ftp.delorie.com/pub/djgpp/current/v2tk/bz2-106ar2.zip Of course, you can always use newer versions of these ports if they are available. The source package is distributed preconfigured to be build in the /_build directory located under the top srcdir. The port has been configured and compiled on WinXP SP3 and Win98SE using gcc346b and bnu2311b. There is no guarantee that this may be possible with any other DOS-like OS. Due to the massive use of long file names it will not be possible to configure and compile without LFN support. Please read the docs. There are no info formated docs. All the extensive documentation is html formated and placed in /share/doc/pcre/html All the changes done to the original distribution are documented in the diffs file and located together with all the files needed to configure the package (config.bat, config.sed, config.site, etc.) in the /djgpp directory. For further information about PCRE2 please read the man pages and NEWS file. As the pcre author noted, the original, very widely deployed PCRE1 library is at version 8.42, and the API and feature set are stable future releases will be for bugfixes only. All new future features will be to PCRE2 and is at version 10.31, not the original PCRE 8.x series. The port of the last version of the PCRE1 library (aka pcre842[b|s].zip) is also available in the /current directory just in case it is prefered over the new API. This is a verbatim extract of the NEWS file: ------------------------------------------------------------------------------- Version 10.31 12-February-2018 ------------------------------ This is mainly a bugfix and tidying release (see ChangeLog for full details). However, there are some minor enhancements. 1. New pcre2_config() options: PCRE2_CONFIG_NEVER_BACKSLASH_C and PCRE2_CONFIG_COMPILED_WIDTHS. 2. New pcre2_pattern_info() option PCRE2_INFO_EXTRAOPTIONS to retrieve the extra compile time options. 3. There are now public names for all the pcre2_compile() error numbers. 4. Added PCRE2_CALLOUT_STARTMATCH and PCRE2_CALLOUT_BACKTRACK bits to a new field callout_flags in callout blocks. ------------------------------------------------------------------------------- The port has been compiled using stock djdev205 and consists of the following two packages that can be downloaded from ftp.delorie.com and mirrors as (time stamp 2018-09-08): PCRE2 10.31 binaries, headers, libs and man formated documentation: ftp://ftp.delorie.com/pub/djgpp/current/v2tk/pcr1031b.zip PCRE2 10.31 source: ftp://ftp.delorie.com/pub/djgpp/current/v2tk/pcr1031s.zip Send PCRE2 specific bug reports to . Send suggestions and bug reports concerning the DJGPP port to comp.os.msdos.djgpp or . Enjoy. Guerrero, Juan Manuel