New word suggestion - PARSE-NAMES

gforth

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

New word suggestion - PARSE-NAMES

From:	James Norris
Subject:	New word suggestion - PARSE-NAMES
Date:	Mon, 10 Aug 2020 11:00:20 -0400
User-agent:	Roundcube Webmail/1.4.7

If you add PARSE-NAMES to Forth, then the user can easily add additionalstate behavior to Forth.

The way PARSE-NAMES (or PARSE-WORDS) works is, it is a combination ofPARSE-NAME and PARSE with a few modifications.

PARSE-NAMES differs from PARSE-NAME in that it treat uenddelimiter as adelimiter for a name in addition to the <SPACE>character. The rules for parsing uenddelimiter are the same as for<SPACE>. This means no white space delimiters areneeded before or after uenddelimiter. Also, after parsinguenddelimiter, the >IN offset will be on the character after

 uneddelimiter.

PARSE-NAMES differs from PARSE in that it is multi-line.
Multi-line means that:

PARSE-NAMES treats line terminator characters as white spacedelimiters.If the current input buffer is a file, ufoundendflag is only true ifthe end of file was reached or uenddelimiter was foundIf the current input buffer was from EVALUATE, ufoundendflag is onlytrue if the end of the length passed to EVALUATE

   was reached or uenddelimiter was found

If the current input buffer was from the user from a terminal inputdevice, then ufoundendflag is only true if the end ofwhatever packet was passed in from the user was reached oruenddelimiter was found.

   This packet may include line terminator characters.

If the current input buffer was from a block, then I'm not supportingblocks or am familiar with their use. I suggest askingpeople who use blocks what they want. But as an initialrecommendation I suggest making ufoundendflag only true when the end

   of the block is reached or uenddelimiter was found.
  Refill is done if your implementation needs to do it.

The reason for adding PARSE-NAMES (or PARSE-WORDS) to Forth is that youcan make words like this:


: VARIABLES{
   BEGIN
    [CHAR] } PARSE-NAMES
    DUP 0= IF
     DROP
    ELSE
     NEXTNAME CREATE
     ALIGN 1 CELLS ALLOT
    THEN
   UNTIL ;

Which you can use like this:

VARIABLES{ x y z }

You could even make a word like CONSTANTS{ which would be the same asabove exceptit tries to convert the name to a number first. If it's a number itpushes it to the data stack.If it's not a number, it uses the name as the name of a new constant.Which you can use like this:


CONSTANTS{ 3 x 4 y 5 z }

You could also make a word that initializes variables. If the name is anumber it pushes it to the data stack.If it's not a number, it creates a new variable with the name and putsthe top number on the data stack into it.

So something like this:

INITIALIZED-VARIABLES{ 3 x 4 y 5 z }

You could also make a word that compiles bytes. Something that covertsthe names to numbers andpushes the low byte of the converted number onto the end of the currentcompile buffer. Something like:


HEX
COMPILE-U8S{ 37 82 FF 63 97 C4 }

Words like COMPILE-U8s might make initializing compile time data easier,and more readable.


PARSE-NAMES can also be used to implement LOCALS|


// Stack action shorthand:
//  ( &quot;&lt;delimiters&gt;word&lt;delimiters&gt;morestuff&quot; |
//     &quot;&lt;delimiters&gt;word&lt;enddelimiter&gt;morestuff&quot;
//     -currentinputbuffer- &quot;&lt;delimiters&gt;morestuff&quot; )
//  ( uenddelimiter -- ufoundendflag c-addr ulength )
//
// Data stack in:

// uenddelimiter a character (byte) that will end theparsing// in addition to the whitespacedelimiter list

//
// Data stack out:

// ufoundendflag FORTH_TRUE if the parse ended onuenddelimiter or// the parse reached the end of thecurrent input

//                                 buffer (file)

// c-addr start address of word in current inputbuffer// ulength length of word in characters (bytes)in the

//                                 current input buffer (file)
//
// Action:

// Moves the current offset pointer (>IN) in the current input bufferto the character after// any leading delimiters or to the character after uenddelimiter orto the end of the buffer if

//   either of those come first to find the start of the next word.

// If the end of the current input buffer or uenddelimiter was foundthen

//   ufoundendflag = FORTH_TRUE and ulength = 0 is returned.

// Else this moves the current offset pointer in the current inputbuffer to after the// next occurrence of a delimiter or to the end of the buffer if thatcomes

//   first, to find the end of the word.
//  Then pushes TRUE to the data stack if an occurrence of uenddelimiter
//   or the end of the current input buffer was reached. Otherwise FALSE
//   is pushed to the data stack.

// Then pushes a pointer to the address of the current offset at thestart of the// word and the length of the word in characters (bytes) onto the datastack.


// Note:

// I suggest these as white space delimiters (this list should workwith most editors):

//   c shorthand      ascii code    name
//   ' '              0x20          &lt;space&gt;
//   '\n'             0x0a          &lt;line feed&gt;
//   '\t'             0x09          &lt;tab&gt;
//   '\v'             0x0b          &lt;vertical tab&gt;
//   '\b'             0x08          &lt;back space&gt;
//   '\r'             0x0c          &lt;carriage return&gt;
//   '\f'             0x0f          &lt;form feed&gt;
//

Jim Norris author of DiaperGlu
http://www.rainbarrel.com

[Prev in Thread]

Current Thread

[Next in Thread]

New word suggestion - PARSE-NAMES, James Norris <=

Prev by Date: Re: GNU/GForth.org-hosted F-Droid repository for freedom-reviewed Android applications written in GForth?
Next by Date: gforth AMD64 assembler problem
Previous by thread: GForth in F-Droid repository?
Next by thread: gforth AMD64 assembler problem
Index(es):
- Date
- Thread