2022-12-28 00:02:06 Ok. I think I've sketched out all of the "subtle" parts of this thing now. There are a few odds and ends still undrawn, like the address registers, a config register I know I'm going to need, etc. But those are all "very simple and straightforward" things. I think I've now got all of the delicate pieces in mind. 2022-12-28 00:02:27 Now I just need to go through it all again, carefully, and make final drawings. 2022-12-28 00:02:37 With no corners cut. 2022-12-28 01:23:52 Well, I think the simplest way to handle external RAM is to just put in my own nop cycles. Instead of >a a@ or >b b! I'd have >a nop a@ or >b nop b! *as required*. The nice thing about that is that I could always try to arrange to have some actual instruction to run in between. 2022-12-28 01:24:08 And if I ever wanted to it would be an easy thing to write the compiler to do for me. 2022-12-28 01:25:09 Well, sometimes. 2022-12-28 01:25:40 To the extent I could tell at compile time which type of RAM the operation was destined for. 2022-12-28 01:25:55 Who knows? It may wind up being that that's rare. 2022-12-28 01:26:19 Anyway, the typical SRAM speeds I've seen on digikey the lsat day or two are in the 8ns-10ns range. 2022-12-28 01:27:06 And for streaming the data out to an LCD I don't need to actually pass the data through the stack - the circuiry that does that will just advance the address at the appropriate rate and insert the retrace delays where suitable. 2022-12-28 01:28:11 One thing I could possibly do is write the code so that when I do modifications on the display RAM it buffers the operations somewhere and then writes a batch of them out after getting a "retrace interrupt." 2022-12-28 01:30:56 That circuitry will use it's own internally generated address to write to the RAM "normally," but when it posts that retrace interrupt it would flip a bit such that the processor address registers produced that address. So I could do normal writes. It would flip it back automatically when it was time to write the next frame / line. 2022-12-28 01:31:09 And it would just be up to me to make sure I didn't try to write new stuff out for two long. 2022-12-28 01:31:24 The writes would just stop working if I tried to do too much. 2022-12-28 01:33:31 Oh, actually there would be no need for the nop in the >b nop b! sequence unless I was about to do another RAM operation. The processor doesn't need that delay to clock the right addr/data out. 2022-12-28 02:33:17 You know, here's a blue-sky idea. I have these two extra bits in each 32-bit control word that define the usage of that cell. 2022-12-28 02:33:55 The cases I've used are a) five packed opcodes, b) Forth call to an address, c) Forth jump to an address (tail optimization). 2022-12-28 02:34:17 I have the opcode level "me" mechanism and also the cell-level rep mechanism for jumps within a word. 2022-12-28 02:34:24 Ok, but there's a fourth case available. 2022-12-28 02:34:42 One possibility would be d) word contains two 15 bit call addresses. 2022-12-28 02:35:17 Wouldn't be usable in all situations, but my gut feel is that it would be *often* usable. The addresses could be either absolute or relative. 2022-12-28 02:35:28 That would result in a DRAMATIC reduction in code size. 2022-12-28 02:35:50 I have a feeling you'd like transport-triggered architectures, KipIngram. 2022-12-28 02:36:13 I don't konw what that is, but I will look it up. 2022-12-28 02:36:43 your code directly drives the connection between processor elements instead of being decoded to those signals. 2022-12-28 02:37:30 Oh, doesn't that make the instruction words a lot bigger? 2022-12-28 02:37:50 Almost all of my control signals in this design can be produced with a single LUT. 2022-12-28 02:37:56 it can or it can't. the simplest instruction is "connect X to Y". 2022-12-28 02:38:01 since 6-bit opcodes and 6-input LUTs. 2022-12-28 02:38:18 Sounds a little like dataflow. 2022-12-28 02:40:33 Anyway, this would add some extra complication in ir_control. I'm only teasing with the thought because it would, in fact, shrink the code so much. 2022-12-28 02:44:38 Oh, looking at it on Wikipedia. Yeah, that is a "view" of things I like. It is, after all, what's REALLY happening. 2022-12-28 02:44:57 You transport stuff from where it's stored to the ALU inupts, you transport the ALU output to the destination. 2022-12-28 02:45:00 And so on. 2022-12-28 02:45:09 That's exactly the level I think at when I do this kind of work. 2022-12-28 02:45:32 Then you go through and make sure that your clock period is long enough for all the right stuff to happen. 2022-12-28 02:45:57 Another way of saying is that you "transport register outputs to combinational circuit inputs." 2022-12-28 02:46:08 And load registers from combinational circuit outputs. 2022-12-28 02:46:13 It's the whole game. 2022-12-28 02:47:14 That's probably the clearest view there is of what a computer "does." 2022-12-28 02:48:06 It's the right level to think about if you have to design the hardware. 2022-12-28 02:49:23 It's like supplying the differential equation of motion in a physics problem, because that differential equation is what turns 'state now' into 'state a short time later.' 2022-12-28 02:49:40 The combinational logic is the computer's "equation of motion." 2022-12-28 02:52:03 It may turn out someday that the whole universe works that way, using the Planck time as the "clock period." 2022-12-28 02:52:13 That's pretty speculative, though. 2022-12-28 03:01:28 Hey, pretty astute call, decay. That really connects with this recent work gracefully. 2022-12-28 03:03:10 In my case, each step is a "set" of transports. I might move things around in several places, relying on the fact that they're happening on the same clock edge. 2022-12-28 03:03:49 I may be sending "now TOS" somewhere and sending something else into "next TOS." If they weren't all guaranteed to happen in parallel, the the result would depend on the order things were done in. 2022-12-28 03:04:36 That's why I've commented from time to time that I think "basic training" for digital EE guys promotes a smoother adoption of threaded programming than the early phases of software training does. 2022-12-28 03:04:55 We just never get that "sequential mind set" baked in to the same extent. 2022-12-28 03:09:05 Well, actually we do a little, but instead of it being "this and then that," it's "this and that AT THE SAME TIME." 2022-12-28 03:09:28 Then we get a little further and learn that that "at the same time" qualifier if iffy if we're not careful. 2022-12-28 03:09:39 And that's where we learn about setup and hold times. 2022-12-28 03:13:48 ;) 2022-12-28 03:22:41 you could use this to your advantage to have a smaller processor design. 2022-12-28 03:22:50 smaller and more modular. 2022-12-28 03:24:48 one of my earlier designs had an input buffer full of instruction words ("connect A to B"/"send A to B") that each of the modules could write to. so your program counter (and program _memory_) needed to be a separate module that you could address. 2022-12-28 03:25:39 the _advantage of that_... was that you suddenly had concurrent execution for free. 2022-12-28 03:25:58 and your programs could include a simple scheduler that could control everything at the gate level. 2022-12-28 05:48:25 Literally seconds before a junction the overhead signs changed saying "turn off at next exit", then signs ahead changed saying "all lanes closed". So I got off 2022-12-28 05:48:46 A lorry overturned, blocking 5 lanes on the motorway, would have been stuck there for like 5 hours 2022-12-28 05:49:13 Well actually looks like they've let people turn around and drive wrong way, but hours nevertheless 2022-12-28 10:53:00 Thank god for occasional common sense. 2022-12-28 10:53:12 Re: the letting folks drive the wrong way thing. 2022-12-28 10:54:31 Glad you dodged that. 2022-12-28 10:54:35 obviously the British side is correct; otherwise, the lance would be held wrong 2022-12-28 12:20:11 Well, I just bit the bullet an ordered one of those Swiss Micro calculators. The DM41X, since it's basically a "clone" of my college calculator. 2022-12-28 12:20:36 Wasn't willing to spring for the express shipping, so I'll have to wait a while for it. Impatient already. :-) 2022-12-28 12:21:02 Could be as long as a month, but they say "usually faster." 2022-12-28 12:23:12 I have altogether too many calculators now. 2022-12-28 12:30:02 I also have two of those "after market mod" ones that I mentioned the other day; I bought one and later couldn't find it so bought another, then found the first one. WP-34S. 2022-12-28 12:30:32 And I've got an "anniversary edition" they made of the HP-35S. I wanted that one because it was manufactured "old style" with the good keys. 2022-12-28 12:30:53 Then I've got to TI nspire cx units that the kids used at some point in high school and I've somehow inherited. 2022-12-28 12:31:04 And I've also got a TI-84 variant. 2022-12-28 12:31:28 So that's six calculators; the DM-41X will make it seven. Getting to be quite a collection. :-) 2022-12-28 12:31:50 The TI-84 also came from a daughter at some point. 2022-12-28 12:32:15 They're all in a drawer of my nightstand. 2022-12-28 13:21:20 I've got quite a few TI calculators, including some -83s, an -86, an -84, and a -92 2022-12-28 13:24:50 I was very interesting at one point in learning how to program them, and running alternative OSs on them, but ran into a lot of trouble trying to get the TILP stuff working on my Guix system 2022-12-28 13:25:03 I was able to get it to work for my -89 2022-12-28 13:25:19 and then one my children dropped the -89 in a toilet 2022-12-28 13:28:24 he was quite young and didn't mean any harm, but wasn't able to get the -89 working again 2022-12-28 13:30:01 :-) 2022-12-28 13:30:10 Children: elementals of destruction. 2022-12-28 13:30:20 I think they totally explain the second law of thermodynamics. 2022-12-28 13:46:43 My sons got some Transformers for Christmas, included a two-headed dragon one. Shall we name it Swap-Swap? 2022-12-28 13:46:52 ACTION uploaded an image: (92KiB) < https://libera.ems.host/_matrix/media/v3/download/trad.chat/NdGePgLSCmStButHUUitwffg/IMG_20221228_124544.jpg > 2022-12-28 14:06:42 Ah, groovy. 2022-12-28 14:07:05 I thought the first couple of Transformers movies were pretty good. 2022-12-28 14:07:21 And Megan Fox was especially good. :-) 2022-12-28 14:07:48 There's a character in my favorite book series that I think she at that age would have been ideal for. 2022-12-28 14:08:29 Maeve, the "Winter Lady" (generally think Winter princess), and she's all-round dangerous to have around. 2022-12-28 14:09:01 Thoroughly irresistable and uses it constantly. 2022-12-28 14:32:43 Winter princess as opposed to turkish delight queen 2022-12-28 14:35:48 :-) I don't immediately think of a character that matches that. 2022-12-28 14:36:17 In those books there's the Fairy Courts, Winter and Summer, and there are several flavors of vampire Courts. 2022-12-28 14:36:35 Three, actually - Black, Red, and White. 2022-12-28 14:37:03 Black is the most like a Stoker-style vampire. Reds aren't too terribly different; still feed on blood, but they can make themselves look better. 2022-12-28 14:37:16 White Court vamps are basically succubi. 2022-12-28 14:37:38 The most prominent ones, at least - some feed on other emotions than lust. 2022-12-28 14:38:41 https://www.savagechickens.com/2009/10/yappy-dog.html 2022-12-28 14:39:55 So aside from my data stack, none of the other stacks are going to have register cached top items. The others will just have a top element addressed, and I'll work directly with the block ram as though it's a register. 2022-12-28 14:40:43 Don't need the registers, because those stacks never do the fancy motions like swap and over and nip and so on. 2022-12-28 14:45:03 KipIngram: white witch from narnia 2022-12-28 14:50:34 Well, the books have wizards. Main character is a wizard. 2022-12-28 14:51:09 I mean the "turkish delight queen" is the white witch from narnia 2022-12-28 14:51:15 Ah. 2022-12-28 14:51:46 I just started a new series with main character Alex Verus. London-based "mage." Sees the future. 2022-12-28 14:51:58 Or futures, rather. 2022-12-28 14:52:13 It's ok so far, but I don't really have an "opinion" yet. 2022-12-28 14:52:43 Weirdly I think to 'get' fantasy you need to start with the bible, and then stuff like beowulf and fairy tales, then lord of the rings 2022-12-28 14:53:17 They're pretty much essential to understanding fantasy 2022-12-28 14:56:47 or Gilgamesh, which totally rips off the whole Noah's Ark thing 2022-12-28 14:57:32 Oh, I agree. If you just approach it as all made up then it's not terribly interesting. 2022-12-28 14:57:47 But if you can look for how the author has exploited real lore, now that can get fun. 2022-12-28 14:58:03 Most of the really ancient cultures had flood myths. 2022-12-28 14:58:13 Who ripped off whom becomes... hard to pin down. 2022-12-28 14:58:32 Gilgamesh is Sumerian and actually pre-dates the Old Testament. 2022-12-28 14:59:18 It all got to be a big melting pot of shared lore. 2022-12-28 14:59:48 We just happen to live in a culture that puts special significance on a particular thread of mythology. 2022-12-28 15:00:25 The books I like and mentioned above - the author does a REALLY good job of weaving all that old stuff into things. 2022-12-28 15:02:05 One fun twist is that it comes up (as a minor sidebar - not a big story element) that the White Court of vampires precipitated Bram Stoker writing his books - they did it to get a "how to kill Black Court vampires" how-to-guid into the hands of the public. 2022-12-28 15:02:32 As a result, the Blacks, once the dominant vampire lineage, got all but exteriminated, with only the most capable and strongest still floating around. 2022-12-28 15:02:45 Pretty much ended them as a "political faction." 2022-12-28 15:17:35 crc: So how is ilo encoded then? Are the 5-bit opcodes packed into cells? Or one operation per cell? 2022-12-28 15:18:06 Four instructions per cell 2022-12-28 15:19:33 And how does IP work then? Are all four executed in a batch? 2022-12-28 15:19:46 Or is IP byte addressed 2022-12-28 15:20:59 https://pastebin.com/raw/dMXKC1fV 2022-12-28 15:21:53 A cell is fetched; each instruction is processed sequentially, but all addressing is cell based 2022-12-28 15:22:21 Thanks 2022-12-28 15:22:40 no problem 2022-12-28 15:22:49 The C code explains it well 2022-12-28 15:22:54 Good C code goes a long way 2022-12-28 15:23:53 fossils.retroforth.org:8000/ilo/file?udc=1&ln=on&ci=trunk&name=ilo.c is the C code for the VM 2022-12-28 15:24:37 That's awesome 2022-12-28 15:24:51 See, to me that's actually easier to read than the docs you posted the other day lol 2022-12-28 15:25:02 I also have nim, lua (requires a build w/32-bit integers), python3, and a few assembly (x86, amd64, aarch64) versions 2022-12-28 15:25:36 Everybody reading please take a moment to skim that VM's source code, it won't take you long 2022-12-28 15:26:43 So basically it runs everywhere 2022-12-28 15:27:25 How large is your retro image in bytes? 2022-12-28 15:28:03 Yes. I've built it for DOS, Mac System 5/6/7, and a wide variety of newer systems 2022-12-28 15:33:00 crc: I'm sort of interested in concept of porting this to ZX Spectrum now, although it would be limited to something like 8K cells, think it would fit? 2022-12-28 15:35:02 8k cells would be tight. The base system is 3818 cells; but I reserve 5,536 cells (not all used) for various system things (tib, sigils, temporary strings & arrays, etc) 2022-12-28 15:36:23 I'll do some tests to see how small I can go with it 2022-12-28 15:37:49 The absolute maximum amount of RAM you have is 10.5K cells, but some of them are slower to access 2022-12-28 15:39:37 crc: Does the basic image assume some kind of ANSI terminal for I/O? 2022-12-28 15:41:47 Also is there some assumption of hard storage 2022-12-28 15:43:42 Ah I see in the VM, silly question 2022-12-28 15:44:13 No assumptions of an ANSI terminal 2022-12-28 15:44:15 Technically I could have load-image/save-image work with tape, but I can ignore that at first 2022-12-28 15:46:42 The VM defines a generic block storage device. The image saving isn't strictly necessary, it's just nice to have 2022-12-28 15:47:21 Yeah 2022-12-28 15:48:28 Do you use a tiny font? 2022-12-28 15:51:21 How does it clear the screen then? It has to assume some kind of escape sequences right? 2022-12-28 15:54:04 In the default system, I don't clear the screen. I just use it as if I were on a simple line terminal. 2022-12-28 15:56:16 For fonts; on my phone, yes; a small font (I use an external keyboard, with font sized to give me about 21 lines and at least 72 columns). On desktop, I go with 80x25, and as large as I can. The optional graphical builds use an 8x16 font presently, on a 720x720 display (with pixel doubling on higher resolution displays if they can fit it). 2022-12-28 15:57:05 There's an optional vocabulary that makes use of ANSI/VT100 escape sequences; it's not required, and I use it sparingly. 2022-12-28 16:00:03 manual clears the screen 2022-12-28 16:00:16 That's all I mean 2022-12-28 16:02:27 What control characters are required then, \n \r \t and \b ? 2022-12-28 16:04:35 spectrum just has a display buffer, no 'terminal', so I would need to implement that logic and keyboard scanning 2022-12-28 16:05:22 Do you have a call opcode, or just execute xt address cells when you come to them? 2022-12-28 16:06:58 Sounds like you're set up a lot like I'm doing, only packing in bytes because that's what a processor can unpack cheaply. 2022-12-28 16:21:12 ACTION mostly reads the manual using the block editor directly. Just \n \r \t \b should be sufficient. 2022-12-28 16:21:33 KipIngram: there's a call instruction; it takes an address from the stack 2022-12-28 16:23:48 The forth is basically a direct threaded thing, with an optional assembler for writing new primitives or for things needing more performance. 2022-12-28 16:24:46 Are you allowed to define machine-specific opcodes then? 2022-12-28 16:28:09 I have experimented with doing that 2022-12-28 16:28:58 on memory use: reducing the size of the string/array pool by half (so just four entries) and leaving out the reserved-for-future-use parts of the buffers would reduce the memory footprint of the buffers to under 1,780 cells. (Of this, 1025 are for the block buffer) 2022-12-28 16:29:03 This would give a minimal memory footprint at just under 5,600 cells for a base system. 2022-12-28 16:31:07 Dropping the block editor & block buffer would save ~2000 cells, but I'd be reluctant to do that. 2022-12-28 16:33:24 But what good is a block editor / block buffer without a hard drive? 2022-12-28 16:33:53 You'd need to 'write' to a section of RAM, before writing out to tape 2022-12-28 16:37:03 running on a tape based system would be different; it's not something I've done thus far 2022-12-28 16:38:21 the "bundle" concept is interesting. 2022-12-28 16:38:41 especially with things that look ahead like the literal push. 2022-12-28 16:40:09 It seems to inherently make sense with a word-addressed system 2022-12-28 16:40:35 definiitely. 2022-12-28 16:41:00 it's also inherently useful if you want to shortcut certain operation patterns. 2022-12-28 16:41:47 instead of performing, say, 4 operations individually, if you know that some integer value corresponds to a small program, you can just write a switch/case to handle that. 2022-12-28 16:42:57 a bunch of loads from memory for example can be done in a single pass. 2022-12-28 16:43:17 or a "load load add store" can be done in a single pass. 2022-12-28 16:43:50 it'll save you a shot through the inner switch for individual ops. 2022-12-28 16:44:15 my son has a branch of the vm where he does that 2022-12-28 16:44:30 he's a good son. ;) 2022-12-28 16:46:49 That's cool that you have a son that is into the same kind of programming, and programming at all 2022-12-28 16:47:01 I hope if I have kids at least one of them is into programming lol 2022-12-28 16:47:16 Bonus if they're into Forth 2022-12-28 16:54:27 I have four boys; both of the older two can program, my second oldest uses Forth. The younger two aren't interested in programming yet 2022-12-28 16:54:55 Nice 2022-12-28 16:59:07 GNU sed is really not that optimised is it 2022-12-28 16:59:28 It's good for some things and really dumb for others 2022-12-28 16:59:49 someone wrote a lisp in sed. I guess that's a benchmark.\ 2022-12-28 16:59:59 time for someone to write a forth in sed. 2022-12-28 17:06:51 Time for someone to write a sed in forth more like 2022-12-28 17:08:44 or benchmark sed against a sh while read line loop for lols 2022-12-28 17:08:50 crc: You require a C99 compiler, not C89, because you don't return from main 2022-12-28 17:09:18 If you return 0 from main at the end it would be compatible with C89 as well (although this will only affect exit code so not a big deal) 2022-12-28 17:23:07 veltas: I'll fix that tonight 2022-12-28 17:24:22 Also if C23 is accepted with the decision to remove K&R-style functions then I think you will have to put explicit void or V in parentheses for function definitions with no arguments 2022-12-28 17:25:21 Although they could have changed the syntax to allow no 'void', I don't actually know 2022-12-28 17:27:57 I'm asking #c because I don't know 2022-12-28 17:33:24 Okay in C23 it will actually be prototyped, so no syntax change required 2022-12-28 17:33:35 This is good because the alternative was going to break a lot of code! 2022-12-28 17:50:25 If I was targeting C23, I'd just do a separate source file for that. 2022-12-28 18:01:16 I have five daughters, but none of them into Forth, and actually none of them SERIOUSLY into programming. 2022-12-28 18:01:33 Not really. They've been exposed to some special purpose stuff like R at school. 2022-12-28 18:03:41 decay: What do you mean by "bundle concept"? 2022-12-28 18:04:44 KipIngram: multiple opcodes stored in a word. 2022-12-28 18:04:53 Oh, ok. 2022-12-28 18:06:26 in crc's interpreter above, he assumes a 32 bit int and extracts the opcodes byte by byte. 2022-12-28 18:08:21 you could shortcut the body of the interpreter by comparing the whole word to a list of known "bundles"/sub-programs. 2022-12-28 18:08:49 and falling back to manual extraction/execution if you don't recognize any. 2022-12-28 18:09:03 Not sure that would even be faster 2022-12-28 18:09:25 Also you do know the VM is under 200 lines 2022-12-28 18:09:28 sure it would. it'd just be a switch that dives into another switch. 2022-12-28 18:09:35 by default. 2022-12-28 18:10:18 It's not really worth it if it makes the VM more complicated, I think its simplicity is highly valuable 2022-12-28 18:11:23 it won't make it more complicated, you already have the whole bundle available in `process_bundle`. just do `switch(opcode){case : ... break; case : ... break; default: ; }` 2022-12-28 18:12:04 arland tells me having the vm recognize & shortcut at least some bundles is faster, at least under the python implementation. I think he's still profiling things to see which specific ones actually help there. 2022-12-28 18:12:43 multi-op operations like "load operands from memory, do op, then store" would benefit. 2022-12-28 18:14:36 hmm I have some sort of noecho echo and cbreak 2022-12-28 18:15:07 I assume ctrl and alt key numbers that I receive need to be masked with something xD 2022-12-28 18:15:20 Yes, I see what you mean. 2022-12-28 18:15:42 [ r cbreak get.char ord .s recurse ] I have this loop so it prints the numeric value of a key 2022-12-28 18:15:50 still cbreak could go outside xD 2022-12-28 18:15:58 what I see also is esc is 27 2022-12-28 18:16:13 but the arrow keys seem to send 3 keys 2022-12-28 18:16:21 which the first is 27 2022-12-28 18:16:34 [27] [91] [65] 2022-12-28 18:16:55 oh.. they're sending esc [ A 2022-12-28 18:17:18 the same you give to the terminal to move the cursor 2022-12-28 18:19:18 still it's sad I have to peek to see if it's esc or an arrow key 2022-12-28 18:19:48 Of course, anywhere such a bundle would help, you could recognize that yourself, use the assembler to write a word that did those operations, and just call that word. 2022-12-28 18:20:14 So it wouldn't really add anything you couldn't otherwise do - it would just make the compiler more complex. 2022-12-28 18:20:56 the compiler isn't involved with that, it'd be the interpreter that's responsible for detecting those patterns in the instruction stream. 2022-12-28 18:20:57 Basically, you'd be creating a "synonym" for the op bundle that was of a form your system could already process without change. 2022-12-28 18:21:08 without the overhead of the call. 2022-12-28 18:21:31 Well, it would still have to somehow call the code that implemented the sequence, right? 2022-12-28 18:21:55 Or would it just be inline? 2022-12-28 18:22:01 Code threaded system? 2022-12-28 18:22:03 inline to the inner interpreter loop. 2022-12-28 18:22:59 So bear with me. Say you have words A, B, and (words you might type into your code to be interpreted), and say they do machine code segments a, b, c. 2022-12-28 18:23:17 If you recognized that as a bundle, somewhere you'd have to have a code segment abc, wouldn't you? 2022-12-28 18:23:34 Where does that segment come from in the first place? 2022-12-28 18:25:01 the assumption is that your system (crc's interpreter) is word-addressible, so your instruction pointer falls along word boundaries. means that all of your code is packed into word-sized bundles. so if you had an assembler for that platform, and all of your instructions are byte-sized, you can pack those instructions into a single word. 2022-12-28 18:25:23 I love it when people find out how complicated escapes in input are 2022-12-28 18:25:30 Sure. 2022-12-28 18:25:36 "Why does vim hang for seconds on end when I press this button?" 2022-12-28 18:25:46 because your system is word-addressible, and deals with words in general, you can compare the contents of memory at the instruction pointer with a set of known constants to recognize sequences of opcodes that are common. 2022-12-28 18:25:55 "Oh they're waiting for the escape sequence timeout" 2022-12-28 18:26:03 I guess when I think of "the interpreter," I think of something that is parsing your input and based upon it is calling pre-existing bits of code, to execute those operations. 2022-12-28 18:26:19 no, this'd be underneath you, as a feature of the processor. 2022-12-28 18:26:25 we're not talking forth, we're talking a VM. 2022-12-28 18:26:35 the forth goes on top of the VM. 2022-12-28 18:27:20 I actually wonder how padding works with this scheme. obviously not all programs fall into word-addressible lengths. 2022-12-28 18:27:31 I think in my system this is what would happen anyway - if I type a sequence of five words that correspond to machine instructions, starting on a cell boundary, that's exactly what it will do - it will compile those five instructions one after another and they will wind up packed into a word. 2022-12-28 18:27:39 Well, when I *compile*, that is. 2022-12-28 18:27:47 right. same goes here. 2022-12-28 18:27:48 The *interpreter* will still execute them one at a time. 2022-12-28 18:28:03 ls 2022-12-28 18:28:14 But we don't expect the interpreter to give maximum performance. 2022-12-28 18:28:50 You're talking about looking at a sequence of words, and if a whole sequence of them are machine instructions, have a table of common machine instruction sequences we could execute/ 2022-12-28 18:29:01 the VM in this case can handle single operations that are spread over multiple opcodes. "load op store" and "load load op store" are some examples of common operations. 2022-12-28 18:29:14 It would parse out all of them and then execute the whole set in one shot? 2022-12-28 18:29:19 no. 2022-12-28 18:29:20 http://fossils.retroforth.org:8000/ilo/file?ci=trunk&name=ilo.c&ln=166 2022-12-28 18:29:22 this line. 2022-12-28 18:29:24 underneath this line. 2022-12-28 18:29:48 have a switch on opcode. each case is a constant representing a sequence of opcodes of a known subprogram. 2022-12-28 18:30:07 common opcode sequences get executed faster because they don't have to go through inner switch one at a time. 2022-12-28 18:30:14 nothing to do with forth. 2022-12-28 18:30:18 How do you decide when to do that? 2022-12-28 18:30:40 The interpreter will have to recognize a whole sequence of words, won't it? 2022-12-28 18:31:06 you'd only be optimizing for common sequences of opcodes. 2022-12-28 18:31:13 in this case, process_bundle() would recognize a cell as matching a particular combination and running a function that does the various steps 2022-12-28 18:31:24 Sure. But you'd have to LOOK for those sequences when scanning the source. 2022-12-28 18:31:32 you don't scan the source..? 2022-12-28 18:31:39 You wouldn't be doing the normal space-delimited word matching anymore? 2022-12-28 18:31:43 this is not forth! 2022-12-28 18:31:48 it's just a bytecode VM! 2022-12-28 18:31:51 there is no parsing! 2022-12-28 18:32:02 there are four lights! wait, wrong context. 2022-12-28 18:32:06 Ok, I think I see now. 2022-12-28 18:32:16 e.g., for a `lit / call` sequence, it could bypass copying the address to the data stack before doing the branch 2022-12-28 18:32:43 I could see maybe using a tree structure; you'd look at the first opcode and see if you had a tree root for it. If not, process it. 2022-12-28 18:33:13 If yes, look at the next opcode and see if you have a child of that root for that byte. if not, execute the root and promote the child to root. 2022-12-28 18:33:15 Etc. 2022-12-28 18:33:33 exploit the fact that you can extract and compare constants without having to check individual bytes. 2022-12-28 18:33:41 You could go as far down as you kept matching, and whenever you rran out, you'd execute the longest sequence you'd found and promote the mismatch to root. 2022-12-28 18:33:47 That might actually be rather efficient. 2022-12-28 18:34:43 You'd look ahead until you picked up a byte for which you had no leaf availble. 2022-12-28 18:35:07 in http://fossils.retroforth.org:8000/ilo/file?ci=tip&name=vm%2Filo.py&ln=218 (python), arland implements a direct handler for the 'lit/call/nop/nop' bundle 2022-12-28 18:35:33 exactly like that. 2022-12-28 18:35:43 bypassing the actual split/execute of the actual instructions 2022-12-28 18:36:06 this makes the vm faster, but longer & more complex 2022-12-28 18:36:41 (he does this in the python, as python is slow enough for things like this to make a noticable difference) 2022-12-28 18:36:57 Right. I get it now. It just took a couple of pokes to get me from "Forth interpreter" to "bytecode interpreter." Now I understand why you would CARE. 2022-12-28 18:37:06 hehe. 2022-12-28 18:37:15 felt like I was speaking french for a moment while everybody else was speaking english. 2022-12-28 18:37:37 Well, I think I came in late and I was the odd man out. 2022-12-28 18:38:14 Yeah I was going to say, the fact it works in Python has no bearing on C 2022-12-28 18:38:15 Now, what I said holds to, though, right? if you recognized that sequence yourself and defined a word for it, and used it, then instead of that bundle you'd have an address and you'd get the same performance benefit? 2022-12-28 18:38:51 not really, because underneath that you'd still need to execute the opcodes one by one. 2022-12-28 18:38:52 Except that doesn't really work for lit/call/nop/nop, because you'd need to know in advance what the lit was. 2022-12-28 18:39:09 (this is assuming you're building a forth on top of the bytecode VM). 2022-12-28 18:39:22 Ok. 2022-12-28 18:39:28 (if you're building a native forth, yeah, it'd have the same perf. benefit.) 2022-12-28 18:39:48 (same concept, different areas of use.) 2022-12-28 18:39:48 I can't imagine a situation I'd want to run a bytecode VM *in python* which is itself a slow interpreted language 2022-12-28 18:40:00 veltas: maybe as a sandboxed language for a game. 2022-12-28 18:40:09 Well 'slow', 'medium' because bash exists 2022-12-28 18:40:41 Well, I implemented a printf functionality thing in my last Forth that used a bytecode interpreter. 2022-12-28 18:40:48 bash makes tcl look like assembler 2022-12-28 18:40:52 The format fields were basically "executed." 2022-12-28 18:46:20 hmm? how does bash make tcl look like assembler? I understand how tcl works and have written code in it 2022-12-28 18:48:45 Zarutian_iPad: performance wise 2022-12-28 18:49:11 I was saying Python is slow, and then I corrected myself and said actually it's 'medium' because bash exists 2022-12-28 18:49:31 And performance-wise, bash makes tcl look like assembler 2022-12-28 18:50:44 The python and lua implementations are the slow ones 2022-12-28 18:54:02 Have you tried luajit? 2022-12-28 18:55:31 The Lua implementation could be improved even in vanilla 5.4, you're using floating-point division where you want integer division for instance, and %256 instead of &0xFF etc 2022-12-28 18:55:54 New Lua has `//` for integer divide, and bitwise operators 2022-12-28 18:56:03 Luajit has library equivalents of these 2022-12-28 18:56:39 I'm not very familiar with Lua, I'll look at luajit 2022-12-28 18:57:20 Well not sure if 0xFF is allowed in Lua but 255 is 2022-12-28 18:57:48 Lua's one of my favourite programming languages, not used it much in years though 2022-12-28 18:58:05 I considered the idea of an emulator for what I'm working on, but the goal there sure wouldn't be top performance. Unpacking my six bit instructions would be a pain. 2022-12-28 18:58:15 It's my favourite 'scripting language' if we don't consider Forth a scripting language 2022-12-28 18:58:37 I'm unfamiliar with it - I'll also take a look. 2022-12-28 18:59:56 awk is my other favourite scripting language, and probably more up your alley 2022-12-28 19:00:55 awk is pretty much just a tiny scripting language with C's syntax and some cool builtin features, impractical for larger projects 2022-12-28 19:01:17 But it impacted every scripting language after it 2022-12-28 19:01:33 Hey. 2022-12-28 19:01:36 Heh, rather. 2022-12-28 19:01:40 From Wikipedia: 2022-12-28 19:01:42 As a result, the base language is light—the full reference interpreter is only about 247 kB compiled[4]—and easily adaptable to a broad range of applications. 2022-12-28 19:01:49 247kB is "light"? 2022-12-28 19:01:58 The person who wrote that must be unfamiliar with Forth. 2022-12-28 19:02:03 It's gotten much bigger over the years 2022-12-28 19:02:09 compared to other scripting languages... 2022-12-28 19:02:17 I think that has been updated without serious consideration 2022-12-28 19:02:46 On the other hand, 247 kB IS light in the context of our current computers. 2022-12-28 19:02:52 Lua used to be reasonably small 2022-12-28 19:03:16 That's not even honest, I think the current interpreter is like 500KB or more 2022-12-28 19:03:42 If you build it statically, which is the only reasonable criteria since we don't really care about these sizes on desktop, only in embedded 2022-12-28 19:03:45 because people want integer support or Unicode or other such crazy things 2022-12-28 19:04:01 Oh, the Lua while structure is kind of nice - allowing a whole block of code for computing the condition. 2022-12-28 19:04:35 I hate Lua's syntax actually, it's my biggest issue with it 2022-12-28 19:04:43 Oh, wait - I misread what I was just speaking of. 2022-12-28 19:05:14 1-based indexing, the ~= operator means != , and also bipolar "while" vs "until" ... I shouldn't have to negate a condition because I went from while to do-while 2022-12-28 19:05:29 "end end end end" 2022-12-28 19:05:43 But the semantics of Lua are really lovely 2022-12-28 19:07:53 The ugly truth is that JavaScript is not that far off of a Lua with better syntax. It's not quite as nice as Lua's model but it's on everyone's computers already and optimised by thousands of people who are paid to be there 2022-12-28 19:09:34 ... all you wanted was the scripting language but it came with Google's Gorilla And Forest Friends 2022-12-28 19:09:36 JS is just a popular Lisp. 2022-12-28 19:10:59 I'm not sure how to think that other than it's a known thing Brendan Eich wanted to do a lisp and tried to apply lisp logic 2022-12-28 19:11:13 that's where I was going. :) 2022-12-28 19:11:18 But it's really not like lisp at all, it's just yet another c-style scripting language 2022-12-28 19:11:24 There's nothing lispy about it at all 2022-12-28 19:11:33 somehow they managed to screw up LET and create a temporal dead zone in JS 2022-12-28 19:11:41 "JS is just a popular Lisp" was a shitpost in the early aughts I think. 2022-12-28 19:12:05 Brendan obviously preferred old-school declarations 2022-12-28 19:12:09 Lua? well as Lisp uses lists for everything, Lua uses tables for nearly everything 2022-12-28 19:12:10 I do too 2022-12-28 19:13:12 Which makes Lua far superior because lists suck 2022-12-28 19:13:38 what's the difference. /s 2022-12-28 19:14:07 Lisp programmers laugh at C programmers for using linked lists occasionally and then it turns out everything in their language is infinitely slower than that somehow 2022-12-28 19:14:22 They have faster datastructures and nobody uses them because they're not as 'elegant' 2022-12-28 19:14:36 huh? I use structs and whatnot in lisp 2022-12-28 19:15:27 Why a high level language even has stuff like association lists that aren't tables is beyond me, they're everywhere in lisp code 2022-12-28 19:15:43 what is a "table", exactly. 2022-12-28 19:15:52 A hash table 2022-12-28 19:15:53 I thought it was just a nested associative array. 2022-12-28 19:15:55 lol. 2022-12-28 19:16:07 you'd really like JS then.. 2022-12-28 19:16:19 Yeah I do compared to lisp lol 2022-12-28 19:18:10 Hash tables are great as long as you have memory to burn. 2022-12-28 19:18:11 It's not like when C++ programmers have to use std::unordered_map on every map with like 4 items when they could have used an array 2022-12-28 19:18:23 Just make it big enough that you have almost no collisions. 2022-12-28 19:20:19 There's a whole science to hash tables nobody appreciates, like did you know a pseudorandom hash is actually unoptimal? You can do way better if you actually understand your data 2022-12-28 19:20:48 Yeah, they're very "mathematical." 2022-12-28 19:22:09 If I store keys in range 4000 to 4010 by just having a size 11 array and subtracting 4000 from the index I've got a perfect spread, no holes. But I had to *think*, awful. 2022-12-28 19:23:04 Right - exactly right. But it's not just "understanding" your data, is having nice data / controlling your data too. 2022-12-28 19:23:12 You couldn't do that with just any 11 random numbers. 2022-12-28 19:23:15 Not as easily. 2022-12-28 19:23:32 What you're really exploiting there is the structured organization of the data itself. 2022-12-28 19:23:45 If you always scrutinise and simplify requirements, it's quite often the case your keys aren't entirely random 2022-12-28 19:23:46 yeah, there's not a general solution to it. 2022-12-28 19:24:06 largely because your key space varies per problem. 2022-12-28 19:24:13 Yeah - you might not "often" be able to do as well as your example there, but you're exactly right - it's rarely "as bad as random." 2022-12-28 19:24:28 There's usually SOME information content in the structure of your data you can exploit. 2022-12-28 19:24:46 Which is why English text compresses by around 3:1, too. 2022-12-28 19:24:52 Same kind of idea. 2022-12-28 19:25:23 English isn't random. 2022-12-28 19:25:40