2023-05-05 00:01:12 unjust: what are you planning to do with that forth in js? 2023-05-05 00:01:54 decay: don't you have your own forth currently? 2023-05-05 00:02:02 Not really. 2023-05-05 00:02:09 and not willing to? 2023-05-05 00:02:21 I have many languages. I haven't found a favorite. 2023-05-05 00:02:40 I enjoy a lot making a toy language 2023-05-05 00:03:33 I'm trying to find something I can just settle down in for a few years. 2023-05-05 00:03:43 and being able to implement my own language in any other language is a cool feature 2023-05-05 00:03:51 I'm having... considerable difficulty. 2023-05-05 00:03:54 I can steal a lot of stuff 2023-05-05 00:05:39 for php makes more sense to make a transpiler 2023-05-05 00:05:42 vms14: not sure, draw on a canvas probably 2023-05-05 00:05:52 but I don't like it 2023-05-05 00:05:53 vms14: maybe generate svg 2023-05-05 00:06:08 unjust: I was going to start adding canvas bindings 2023-05-05 00:06:22 my goal is to make a simple game 2023-05-05 00:07:01 but wanted to make a todo app, as it's like the next program after a hello world 2023-05-05 00:08:59 I think my favourite lang has become my own language 2023-05-05 00:13:25 a few years ago, part of a customer information display system that i built had an in-browser feature that used a canvas element (plus javascript) to allow a user to generate an overlay definition (featuring dynamic text + graphics sourced from other data providers, in JSON) on top of a JPEG/PNG image 2023-05-05 00:13:56 it had a nice grid + vert/horiz line guidance system with auto-snapping 2023-05-05 00:14:26 i would like to recreate something similar without javascript getting in the way this time 2023-05-05 00:15:06 :0 2023-05-05 00:15:12 I want to see the code when you do it 2023-05-05 00:15:18 the code in your forth 2023-05-05 00:17:04 sure, if i ever get around to that, i'll show you 2023-05-05 00:17:10 :D 2023-05-05 00:21:02 decay: wouldn't a language implemented by you be enough to settle on? 2023-05-05 00:21:26 for example if you do like crc, making a vm, it would be quite portable 2023-05-05 00:21:35 not with that greener grass being right over there 2023-05-05 00:21:44 which means you can benefit from almost every language 2023-05-05 00:22:13 yeah and it's almost summer, there's a lot of bugs 2023-05-05 00:23:06 vms14: I've implemented everything from tiny concatenative languages, to term rewriting languages that are suitable for hardware synthesis. 2023-05-05 00:23:37 so you can't decide on which kind of language to implement? 2023-05-05 00:23:56 There are caveats with all of them. But I have a model that I'm happy with, I just need to put a method of programming into it. 2023-05-05 00:24:27 what are the caveats of concat langs more than stack juggling? 2023-05-05 00:24:54 I miss optional arguments, it forces me to use lists, but it's doable 2023-05-05 00:25:06 My requirements are pretty strict, in that things as basic as numbers aren't included. 2023-05-05 00:25:48 do you have documentation on some of those langs? 2023-05-05 00:26:12 The caveats of concatenative languages is that you have two flavors: you have languages like Joy, and languages like Forth. Forth always assumes some underlying machinery, and things need to "look" like a traditional von Neumann machine. 2023-05-05 00:26:39 Joy is more symbolic, but you need quotations, which are more complex than you'd think. 2023-05-05 00:27:07 So you have this duality between wanting to be symbolic, far away from a specific machine architecture, and being very close to a specific machine architecture. 2023-05-05 00:27:40 There's a gradient between the two. 2023-05-05 00:28:17 The same can be said for most languages/models of computation. 2023-05-05 00:28:37 my lang does not even know what the machine is 2023-05-05 00:28:49 You either have machinery following an instruction graph or symbolic manipulation that warrants reduction steps. 2023-05-05 00:29:02 I have not been happy with either. 2023-05-05 00:31:24 I think I'm getting closer to what I want, though. I have a model that functions as a model of concurrency, a model of capabilities (security-wise) and a method of modeling mobile agents. 2023-05-05 00:31:43 I just don't have a programming method for the agents. 2023-05-05 00:31:59 decay: That's really interesting. In a way, we could view both term rewriting and assembly are both languages implemented in their respective metalogics. 2023-05-05 00:32:32 xelxebar: One's mind starts focusing on the modeling overhead of each method. 2023-05-05 00:32:48 And it really comes out in something like sending a running program over the network. 2023-05-05 00:33:11 If I wanted to lift a running program from my machine to another machine, _while it's running_, I need to transmit three things. 2023-05-05 00:33:34 The memory that the program is using, the program memory that the program code exists in, and the program counter. 2023-05-05 00:34:16 That's assuming a traditional VM. But, what happens if I change the model? Let's assume I have a bunch of term rewrite rules, with one large term as my program state. 2023-05-05 00:34:38 The number of things I need to transmit, now, is just 2: the list of term rewrite rules, and the term under rewrite. 2023-05-05 00:34:52 The state of the "program" is just the current term. 2023-05-05 00:35:56 What if you go further, to something like Joy? I could send a quotation across a network after composing it with its required arguments, so that when it's evaluated, it "unpacks" itself and starts running. All of a sudden, the number of things I need to transmit drops to 1: the code I want to run. 2023-05-05 00:36:14 The number of conceptual objects matters. 2023-05-05 00:36:38 yeah, but somehow giving the memory feels more powerful 2023-05-05 00:36:55 it could send a program while it's running 2023-05-05 00:37:02 The latter two do the same thing. 2023-05-05 00:37:10 stopping before send, and resuming once it arrives on the other machine 2023-05-05 00:37:17 That's.. what all of them do. 2023-05-05 00:37:45 giving code is not as giving the state of the whole program 2023-05-05 00:37:46 There are likely clear mechanical transformations between the different models, but I gather you're focusing more on the "conceptual weight". Is that true? 2023-05-05 00:38:10 vms14: I'm not giving code. I'm giving code + state. 2023-05-05 00:38:23 It turns out that in the latter two, you can start merging them. 2023-05-05 00:38:33 then it should be the same as with memory the code is part of the state itself 2023-05-05 00:38:33 xelxebar: Yeah. Conceptual weight would be a good way to put it. 2023-05-05 00:38:46 Because conceptual weight translates into implementation weight. 2023-05-05 00:40:07 Interesting. How base conceptual models impact downstream maintenance and dev costs is something I've been starting to cognate concretely about lately. 2023-05-05 00:40:49 It becomes pretty expensive. 2023-05-05 00:41:12 Even something as trivial as word size in a VM impacts things like algorithm complexity unless you stick with high word sizes. 2023-05-05 00:41:28 we are rich in resources 2023-05-05 00:41:38 is the only way I can be rich 2023-05-05 00:41:41 We are gluttonous with resources and cannot plan for usage. 2023-05-05 00:41:56 I want to know the concrete memory footprint of 20,000 agents, for example. 2023-05-05 00:42:13 But I can't do that if the internal model of those agents includes dynamic memory allocation (implicit or explicit). 2023-05-05 00:42:33 agents as in the actor model? 2023-05-05 00:42:39 No. 2023-05-05 00:43:00 Running pieces of code that can interact with some surrounding environment via synchronous communication methods. 2023-05-05 00:44:08 APL is actually pretty nice in that regard. It affords static analysis of exact memory footprint down to the byte. 2023-05-05 00:45:49 Not just theoretically, even. It's something APLers actually end up doing in practice. I'm still don't grok the precise design decisions of APL as a programming language that manifest such a thing, though. 2023-05-05 00:47:07 Resource planning also allows you to do things like run untrusted code. 2023-05-05 00:47:12 From arbitrary sources. 2023-05-05 00:47:21 If you know an agent takes, say, 32 bytes. 2023-05-05 00:47:38 You can say that an assemblage of them coming from some network location is only allowed to be 300 agents wide. 2023-05-05 00:52:26 Trust models are certainly important, but that seems like a pretty orthogonal issue. Just as mutual cooperation in the prisoner's dilemma leads to better outcomes than defecting, defaulting to adversarial models incurs pretty high costs. 2023-05-05 00:53:56 In this case, giving hard upper bounds on memory usage is a much weaker guarantee than *also* having tight lower bounds as well. 2023-05-05 00:54:55 Why? 2023-05-05 00:55:29 The assumption that any code that runs on your machine will be malicious isn't exactly something divorced from reality. 2023-05-05 00:55:39 Especially if you accept it from any source. 2023-05-05 01:08:59 Depends on what you're doing. I'm imagining that you're designing a language with concurrent agents. Building in adversarial semantics means that I have to pay the cost of guardrails even in cases where I'm the one controlling all agents. 2023-05-05 01:09:56 Within-aplication agents usually go out of their way to cooperate, and we only need to cordon off specific ones, like those that handle user input or whatever. 2023-05-05 01:10:23 That is, unless your threat model is really hard, and you want to be extremely cautious :D 2023-05-05 01:12:13 Anyway, I'm just making a descriptive statement, not a normative one. Adversarial agents cost more overhead than cooperative ones. It makes sense to ask when and where you want to spend that cost. 2023-05-05 08:48:15 This is a great read about TCL https://yosefk.com/blog/i-cant-believe-im-praising-tcl.html 2023-05-05 08:48:39 And it's about how useful it can be with embedded work, and I think that a lot of what's praised here applies to Forth as well 2023-05-05 08:50:28 and later thishttps://yosefk.com/blog/my-history-with-forth-stack-machines.html 2023-05-05 08:50:30 also 2023-05-05 08:50:45 i think it was here that tclforth got linked, right? 2023-05-05 08:52:21 Well I seem to remember he doesn't like Forth 2023-05-05 08:53:08 My point in linking the article is just that it helps illuminate slightly why languages like Forth and Tcl have good syntax/features for something interactive 2023-05-05 08:53:26 aye 2023-05-05 08:55:12 Also they were wondering how to achieve something as 'nice' in an infix language, well Haskell didn't get mentioned. I think Haskell's syntax is a good effort at removing unncessary commas and parentheses 2023-05-05 08:55:39 From a language with more strong syntax, rather than just shell-style "everything is a string" syntax 2023-05-05 08:58:40 drakonis: yes, i think someone linked to https://github.com/wejgaard/TclForth 2023-05-05 09:00:10 the Expect library for TCL is an underappreciated tool for automation 2023-05-05 09:01:04 the TclForth creator's Holon* projects are pretty interesting 2023-05-05 09:01:28 surprisingly smalltalk-ish 2023-05-05 09:30:05 TCL: what if a shell was bitten by a radioactive lisp 2023-05-05 09:31:23 Ugh. Controversy in the reddit community I help moderate. 2023-05-05 09:31:57 A couple of folks posted AI generated arwork, in the "Fan Art" category. They got reported for copyright violation, and now there's a big stew brewing about the issue in general. 2023-05-05 09:32:20 which one? 2023-05-05 09:32:30 On the grounds that those engines scrape existing artwork online, and thus violate the rights of the creators of that art. 2023-05-05 09:32:48 notice how the little people get the copyright violation notices, not le googles 2023-05-05 09:32:48 Midjourney on the first one. Different engine on the second. 2023-05-05 09:33:04 Yeah. 2023-05-05 09:33:24 Well, no one is arguing that punitive action should be taken against the people that produced the images. 2023-05-05 09:33:37 The critics just want us to ban AI art from the community. 2023-05-05 09:34:10 It's an issue I'd just never previousl given any thought to. 2023-05-05 09:34:25 I actually didn't even know Midjourney existed until this came up. 2023-05-05 09:34:26 its not /r/art, is it? 2023-05-05 09:34:48 No, it's /r/dresdenfiles. Aimed at a particular urban fantasy series. 2023-05-05 09:34:52 ah yes. 2023-05-05 09:35:18 AI art has interesting consequences such as ending the trade of being an artist 2023-05-05 09:35:25 It's not an activity I'd have expected to find myself engaged in before I started, but I've had a good time helping out. 2023-05-05 09:35:28 that's just an AI bot being a great artist, isn't it? stealing instead of copying? 2023-05-05 09:35:29 (its a really bad consequence) 2023-05-05 09:35:39 Yeah, it's a thorny issue for sure. 2023-05-05 09:35:53 And is going to be a lot more widespread than just art in the long run. 2023-05-05 09:37:29 i doubt AI art is going to kill human (or other lifeform?) rendered art, there'll always be people who want to support other people in creative endeavours 2023-05-05 09:38:17 the consequence is that less people commission art because of it 2023-05-05 09:38:51 why pay someone when you can feed the AI what you want until you get the desirable results 2023-05-05 09:39:01 creative endeavours... how's that going... https://www.eff.org/deeplinks/2023/01/have-you-tried-turning-it-and-again-rethinking-tech-regulation-and-creative-labor#main-content 2023-05-05 09:39:19 the advent of photograpy had the same effect, it didn't kill off the portrait and landscape artists altogether 2023-05-05 09:39:38 Sure. PowerPoint did it to some fraction of the "graphic artist" biz. 2023-05-05 09:40:45 Lot of technology has had this sort of effect. Early in the industrial revolution people would sabatage machinery because they felt like it destroyed jobs (which it did). 2023-05-05 09:42:01 a) work 14 hour days in factory b) starve to death 2023-05-05 09:42:07 things evolve and people adapt, new vocations appear to fill the void inevitably 2023-05-05 09:44:49 yes, though you have horrid things like hollywood execs wanting to use chatgpt to generate scripts 2023-05-05 09:44:50 That's the hoped for result, yes. I'm not sure it's an inexhaustible resource, though. 2023-05-05 09:45:05 because the writer's guild is on strike 2023-05-05 09:47:42 if they blindly rely on the output of artificial intelligence, i'd agree that's pretty terrible. but what if they use it as augmented intelligence instead and edit/extend/redact the generated script to their liking? 2023-05-05 09:47:42 the difference between the industrial revolution and AI is that AI replaces an inherently creative endeavor 2023-05-05 09:49:02 the machines sped up repetitive work during the industrial revolution 2023-05-05 09:50:21 it was less safe yet you still needed underpaid humans to operate the machines 2023-05-05 09:50:25 AI as it stands is just a fuzzy search over a corpus. It can't replace creative endeavors, only regurgitate and mix the end results of creativity and hand that mixture back to you. 2023-05-05 09:51:14 exactly that 2023-05-05 09:51:51 it is inherently dependent on existing human creations to output its data 2023-05-05 09:52:14 The only "surprise" is that the granularity got better. Take voice generation for example. 2023-05-05 09:52:41 Given a sentence, and a person's voice profile, generate an audio file of the person saying the sentence. 2023-05-05 09:52:42 if it somehow managed to make it impossible to live off art, it would have less data to work off 2023-05-05 09:52:44 Well, you might argue that the industrial revolution replaced "inherently skilled" labor. 2023-05-05 09:52:50 it did not 2023-05-05 09:52:56 I'm not sure the nature of the ability makes a lot of difference. 2023-05-05 09:53:29 At one point we relied on, essentially, sentence mixing by picking out words from the chunks of audio of that person. 2023-05-05 09:53:45 And stitching them together and applying postprocessing to them. 2023-05-05 09:53:50 Yeah, I've heard of people using AI to generate voices, which they then use to contact someone's parents and ask for emergency money to be sent. 2023-05-05 09:53:52 To make one word flow into the next. It sucks. 2023-05-05 09:53:59 My wife read about it in an article a week ago or so. 2023-05-05 09:54:36 Flash forward a couple of years, it got more granular: instead of vowels, you can now do it with phonemes. 2023-05-05 09:54:40 I figure we'll get to the point where photographic evidence is no good in court anymore. 2023-05-05 09:55:00 Courts have provenance, that hasn't changed. 2023-05-05 09:55:16 Chain of custody of evidence always will be a thing. The photographic evidence had to come from somewhere. 2023-05-05 09:55:40 just wait, someone will try and sell the idea of an AI judge + jury 2023-05-05 09:55:41 But, on voices, now we've gone from phonemes to sub-phoneme construction. The only thing that's improved over that span of time is granularity. 2023-05-05 09:55:42 Well, but take security camera footage or something. Yes, it's in a chain of custody after it's first secured. 2023-05-05 09:55:48 But there's no telling where it came from. 2023-05-05 09:56:42 unjust: At least one state is already using AI to determine criminal sentences. 2023-05-05 09:56:52 There are nuances that make that less likely. For one, security camera footage comes from security cameras. So there are audit logs, access logs, etc. 2023-05-05 09:57:05 don't need AI for that. poor? throw 'em in jail 2023-05-05 09:57:06 And when a defense attorney requested to study the source code, the request was denied. 2023-05-05 09:57:17 If someone had that level of control over a security system and had enough training data to fake footage, they wouldn't even need AI. 2023-05-05 09:57:20 KipIngram: sentence duration only? 2023-05-05 09:57:33 I don't know that detail. 2023-05-05 09:57:49 It was somewhere up in the farm belt - been a couple of years since I read about it. 2023-05-05 10:12:33 actually with AI you might get rid of the competent judges and replace them with kowtowing flunkies 2023-05-05 10:12:56 oh yes. 2023-05-05 10:13:00 lots of potential for abuse 2023-05-05 10:13:16 https://twitter.com/AlexBlechman/status/1457842724128833538 2023-05-05 10:13:25 an evergreen tweet 2023-05-05 10:14:06 It's the kind of thing I would have just never believed would even be considered earlier in my life. 2023-05-05 10:14:43 But new generations come along and just don't have the same "core principles." 2023-05-05 10:14:47 thrig: ...you mean like the thing that's already happening? :P 2023-05-05 10:15:45 shush! 2023-05-05 10:16:11 KipIngram: What do you mean by "core principals"? 2023-05-05 10:16:12 this is some, uh, hypothetical future dystopia we would never be so foolish to dabble with 2023-05-05 10:16:38 Or, rather, what principals are you talking about specifically. 2023-05-05 10:17:09 Um, well, just things that we'd fundamentally not want to give up, like the right to trial by jury of our peers. AIs wouldn't be our peers. 2023-05-05 10:17:39 I'm pretty sure nobody wants to be tried by an AI... 2023-05-05 10:17:55 Intergenerationally. 2023-05-05 10:17:59 Back some years ago I saw some girl in a news interview. No one particular, just a "face on the street." And she actually said outright "This freedom of speech thing has got to go." 2023-05-05 10:18:08 I nearly fell out of my chair. 2023-05-05 10:18:16 And, how old was that girl. :P 2023-05-05 10:18:24 Oh, young, but voting age. 2023-05-05 10:18:29 Young as in.. 2023-05-05 10:18:37 That's what I mean about new generations coming along. 2023-05-05 10:18:44 Twenty? 2023-05-05 10:18:58 And some years ago would be, what, a decade? 5 years? 2023-05-05 10:19:04 6-7. 2023-05-05 10:19:14 Well, 7 years ago, I was 20. 2023-05-05 10:19:23 I'm pretty sure it's not a generational thing. 2023-05-05 10:20:08 Fair enough - I wouldn't want to imply that it's a universal thing. And I'm sure you could have found someone much earlier. But it was just a shocking thing to hear on the news. 2023-05-05 10:20:10 7 years ago was 2016, which was just the beginning of the social erosion of our country. 2023-05-05 10:20:13 Of course, "shock" is what they go for. 2023-05-05 10:20:49 That's a separate problem - the tendency of the news to focus on sensation instead of "best truth." 2023-05-05 10:20:57 If you ever want to understand Russia's actions on the world stage... https://en.wikipedia.org/wiki/Foundations_of_Geopolitics#Content 2023-05-05 10:21:05 There's always been yellow journalism. 2023-05-05 10:21:11 But these days it's practically all yellow. 2023-05-05 10:21:39 And I've watched that happen during my life. 2023-05-05 10:21:54 I'd call it green journalism, as in the color of money. And the Russians pay well. 2023-05-05 10:22:35 Anything profit driven will optimize itself for the largest market share. 2023-05-05 10:22:40 That's fair too - yellow is just a euphemism that's been around for a long time. 2023-05-05 10:22:49 But yeah, money is part of it too. 2023-05-05 10:23:01 I mean, they go yellow because they decide that makes more money. 2023-05-05 10:23:08 I'd say money is the only driving factor. Shock value means attackment. 2023-05-05 10:23:11 *attachment 2023-05-05 10:23:30 It taps into the cyclical expectation-reward system. 2023-05-05 10:23:44 I'm willing to believe that money is the ultimate root cause. 2023-05-05 10:23:59 If you can deliver shock value that's ideologically pleasing to the reader, you have a hooked market. 2023-05-05 10:24:03 Yellow journalism is a "behavior." And probably motivated by money. 2023-05-05 10:24:16 Hence, Fox, Newsmax... 2023-05-05 10:24:26 And so on. 2023-05-05 10:24:36 I don't know one that remains immune to this. 2023-05-05 10:24:42 No big one, at least. 2023-05-05 10:24:47 Agreed. 2023-05-05 10:25:01 My wife thinks the Wall Street Journal is better than most. 2023-05-05 10:25:10 But I haven't read it, so I don't have an opinion there. 2023-05-05 10:25:25 The fact that it's incentivized even at the lower levels, i.e on individual reporters to find a "scoop" (which is a trope that's been around for longer than either of us), is a problem. 2023-05-05 10:25:34 Yes. 2023-05-05 10:25:49 But that's a symptoms of seeking markets. 2023-05-05 10:25:53 *symptom 2023-05-05 10:25:59 fingers, brain, get together and figure it oout 2023-05-05 10:26:03 I think the internet and "deep conversation" podcasts and so on are our best option these days, but you can't consume just one. 2023-05-05 10:26:04 *out, damnit. 2023-05-05 10:26:12 You need to consume a bunch of them, across the spectrum. 2023-05-05 10:26:17 And then think about things yourself. 2023-05-05 10:26:34 I would really rather we just report things that happened rather than also offering accessory perspective. 2023-05-05 10:26:42 The Fairness Doctrine needs to come back. 2023-05-05 10:26:50 Would suit me. 2023-05-05 10:27:34 I did find a news website that was just data, graphs and so on 2023-05-05 10:27:37 When I was in college I made a point to watch the NBC news every evening. Just to keep up with the world. As the years went by that became a waste of time. 2023-05-05 10:27:56 What year? 2023-05-05 10:27:59 Or, rather, years. 2023-05-05 10:28:09 I've given thought to writing my own "AI" news scraper. 2023-05-05 10:28:20 I'm imagining you weren't in college during the Reagan years. 2023-05-05 10:28:22 Just to gather multiple sources up into one place. 2023-05-05 10:28:29 Or post-Reagan. 2023-05-05 10:28:39 I was, in fact. 2023-05-05 10:28:44 1981-1985. 2023-05-05 10:28:46 like https://ground.news ? 2023-05-05 10:28:53 ooooo just before it got revoked. 2023-05-05 10:29:15 1987. 2023-05-05 10:29:20 sorry for butting into the conversation 2023-05-05 10:29:26 That sounds about right. 2023-05-05 10:29:38 nmz: No problem, man, it's not like we're on topic or anything here. 2023-05-05 10:29:40 1440 is also a sort of aggregator or news scraper 2023-05-05 10:30:03 Part of that desire is just to try my hand at that kind of AI. 2023-05-05 10:30:09 "DIY" mentality. 2023-05-05 10:30:21 Plus I'd run it in console. 2023-05-05 10:31:57 someone made a summary AI thing but they charge for the API 2023-05-05 10:45:59 would be great if you could slap [citation needed] on most of the news out there 2023-05-05 10:46:55 or ignore it, as it's mostly shocking and/or depressing 2023-05-05 10:47:37 i'd bet there'd be less garbage out there, if (for the most part) the journalists had to divulge the sources of their info 2023-05-05 10:48:54 Probably, but there are downsides of that too - it would be a tradeoff. 2023-05-05 10:57:25 So, I'm running a drive test at the moment that uses 2.85:1 compressible data - I write enough of it to fill the drive 50% full physically. 2023-05-05 10:57:37 Normally I format before such tests, but I forgot this time. 2023-05-05 10:57:56 It's not really a problem - the previous data on the drive also filled 50% of it and was 1:1; it will all get overwritten. 2023-05-05 10:58:07 But it is kind of interesting to watch the fill percentage as this happens. 2023-05-05 10:58:26 It goes down from 50% at first, as some of the 1:1 data is overwritten with 2.85:1 data. 2023-05-05 10:58:44 It will go down for a while and then start going back up, and will go back to 50% in the end. 2023-05-05 10:59:33 I've often wondered why we don't have drives that have hardware compression. 2023-05-05 10:59:52 This drive does. 2023-05-05 11:00:05 ... 2023-05-05 11:00:08 show me 2023-05-05 11:00:14 I know kung-fu! 2023-05-05 11:00:29 LOL. 2023-05-05 11:00:36 https://www.ibm.com/docs/en/flashsystem-9x00/8.2.x?topic=overview-flashcore-modules 2023-05-05 11:01:13 The biggest one holds 38.4 TB physical. You could put 100 TB or so of English text on it. Palm of your hand. 2023-05-05 11:01:47 Jesus. 2023-05-05 11:01:56 Yeah, it's pretty impressive. 2023-05-05 11:02:12 What kind of algorithm do they use? 2023-05-05 11:02:13 We don't sell it standalone - we put it in rackmount products that we build. 2023-05-05 11:02:19 You can get a box with 48 of them. 2023-05-05 11:02:24 2U rack box. 2023-05-05 11:02:26 I keep forgetting you work at IBM. 2023-05-05 11:02:46 Does it compress at rest or compress in-stream? 2023-05-05 11:02:55 On the fly. 2023-05-05 11:03:04 Damn. 2023-05-05 11:03:13 I'm even more interested in the algorithm. 2023-05-05 11:04:18 I don't "know it" to any extent. I can ask about it and see what aspects of it might be sharable. 2023-05-05 11:04:52 I'm even more impressed with the error detection and correction hardware, though. 2023-05-05 11:05:10 This QLC flash is crap - it develops bit errors over time just sitting there. 2023-05-05 11:05:39 But we have a background process running that scans through all the written data (takes a week or two to get to all of it), reads each page and counts the errors. 2023-05-05 11:05:54 When that rises above a certain threshold, we re-write that page to a new location. 2023-05-05 11:06:03 Sets the error count back to zero. 2023-05-05 11:06:16 We don't let pages accumulate more errors than the algorithm can correct. 2023-05-05 11:06:43 So you can't turn these boxes off and walk away for a month or anything like that - they need to be running all the time. 2023-05-05 11:07:19 Gross. 2023-05-05 11:07:20 When a page starts developing errors at too high a rate, we retire it. 2023-05-05 11:07:30 We're just moving to battery backed SRAM by the day. :P 2023-05-05 11:07:34 Yeah, sometimes I describe it by saying that we make armor out of tissue paper and spit. 2023-05-05 11:07:57 We start with this problematic flash substrate, and wind up with enterprise grade data storage. 2023-05-05 11:08:09 actually paper/glue armor is pretty good 2023-05-05 11:08:13 It is actually fun work. Not that I actually do the development work. 2023-05-05 11:08:27 I can imagine! I wouldn't wanna use it, but the work itself sounds fun as hell. 2023-05-05 11:08:35 (or can be pretty good, if you do it right) 2023-05-05 11:08:44 I do performance testing, which involves enough "understanding" of the innards that I'm able to keep my developer pallete wet enough. 2023-05-05 11:08:46 Most of the time. 2023-05-05 11:09:08 It's like all this stuff has its own laws of physics. 2023-05-05 11:09:25 It really does. 2023-05-05 11:09:43 I'd like to make my own flash-based storage box one of these days. 2023-05-05 11:10:19 It would never be something I could sell, though, because I'd build things into infringed, most likely. 2023-05-05 11:10:24 into it 2023-05-05 11:10:30 into it that 2023-05-05 11:10:31 Ugh. 2023-05-05 11:10:57 fingers are lagging behind my brain this morning. 2023-05-05 11:11:57 Anyway, I talk at times with the guy that does the error detection and correction FPGA. Really fancy stuff, involving polynomial fields and so on. 2023-05-05 11:12:26 Oh hell yeah. 2023-05-05 11:12:46 I'll definitely see if I can at least find out what general family of compression algorithms we draw from. 2023-05-05 11:12:54 I'm sure it "starts out" as something well known. 2023-05-05 11:13:09 With minor touches being added in, I'm sure. ;) 2023-05-05 11:13:24 Yeah, which I probably won't even get to know about. :-) 2023-05-05 11:13:29 The "secret sauce." 2023-05-05 11:13:56 I wonder, given the battery technologies we have now, if it'd be possible to build a battery-backed SRAM drive. 2023-05-05 11:14:03 We're not the fastest drive on the block. Samsung makes drives that outrun us by a fair margin. But they're much lower capacity. 2023-05-05 11:14:05 With similar capacities. 2023-05-05 11:14:25 There is some kind of drive that is basically that. 2023-05-05 11:14:35 I'm trying to think of the algorithm. 2023-05-05 11:14:56 And part of what our box level software does is let you mix a few of those in with the flash drives, and it runs a "tiered" storage system on all that. 2023-05-05 11:15:03 Keeping the hot data in the faster drives. 2023-05-05 11:15:10 Smart! 2023-05-05 11:15:27 That's mostly what these boxes are - a big bag of small smart things. 2023-05-05 11:16:02 Then some additional caching is done with ram directly in the box, rather than packaged as drives. Even faster. 2023-05-05 11:16:08 But still less capacity. 2023-05-05 11:16:24 I was wondering if there was a hot RAM cache. 2023-05-05 11:16:42 Yeah, but my work doesn't expose me to the box level. 2023-05-05 11:16:47 I do single-drive testing. 2023-05-05 11:17:46 So I put drives in a Linux host, and run open source storage testing tools. 2023-05-05 11:18:12 I don't use the Linux nvme driver, though - I replace it with a third party thing from Intel that has a lot less overhead. 2023-05-05 11:18:57 I'm not really interested in that overhead at all, so the lower I can make it the better. 2023-05-05 11:20:00 A few years ago there was a bit of a flap because my single-drive performance numbers were coming in 10% or so higher than the single drive performance "extracted" from system level tests. 2023-05-05 11:20:22 And some big shots started watching and wanted to know why we weren't getting my measured performance in the system. 2023-05-05 11:20:45 Turned out it's because I only run one CPU thread per PCI port - that thread doesn't have to contend for it. 2023-05-05 11:20:55 The system runs a whole bunch of threads, and they have to share. 2023-05-05 11:21:09 Once I did a test to emulate that setup, I measured the same thing they did. 2023-05-05 11:21:24 Bahahahaha. 2023-05-05 11:21:30 But the system is trying to run 48 drives, not just one. 2023-05-05 11:21:47 So it pretty much has to use all the cores and they have to share all the drives - there's really no way around it. 2023-05-05 11:22:24 Yeah, you're going to get some contention unless you straight up have a 48 core+ CPU.. 2023-05-05 11:22:43 But we at least came to understand the difference - that was good enough for the big shots. 2023-05-05 11:57:25 Is there a forth word to flush (drop) the entire stack? 2023-05-05 11:57:47 bye 2023-05-05 11:58:27 I mean without exiting 2023-05-05 11:58:33 details, details 2023-05-05 11:58:36 heh 2023-05-05 11:59:09 : empty BEGIN DEPTH 0> WHILE DROP REPEAT ; 2023-05-05 11:59:15 is what I have in my pforth.fs 2023-05-05 11:59:20 k, thanks 2023-05-05 12:04:52 there's CLEARSTACK too in some forths 2023-05-05 12:17:58 sp0 @ sp! 2023-05-05 12:18:24 If your Forth is written that way. 2023-05-05 12:20:11 ANS aside, there's a lot of unportability 2023-05-05 12:21:21 Possibly. But that's not as far out in left field as the number format stuff I was talking about yesterday, which I cooked up from scratch. 2023-05-05 12:21:42 Well, it was suggested to me by a friend. But I'd never seen it in a Forth previously. 2023-05-05 12:23:43 That stack stuff I have, and I've also seen it in some books. 2023-05-05 12:23:55 I think it may be of FIG origins. 2023-05-05 12:32:53 Anyway, it has the virtue of being the simplest way to do it. 2023-05-05 12:59:59 I'm torn over whether to store a thread's register values, when it's not running, on its data stack or in some system buffer. I want to think it doesn't really matter much either way. 2023-05-05 13:00:07 But I'm not quite sure of that yet. 2023-05-05 13:00:38 we are the registers of christmas past! 2023-05-05 13:02:18 If I keep them on the stack, then every thread's stack space will have to be sufficient to hold that, in addition to whatever the thread has on the stack. But that space has to be somewhere, so why not there? 2023-05-05 13:03:13 I suppose you could argue that the thread could use that space for itself, so long as it didn't let that condition persist across a thread swap. 2023-05-05 13:03:24 But having the thread be aware of that has its own yuck. 2023-05-05 13:04:18 Ok, my drive just finished overwriting all the 1:1 data - the fill state has started to move back up. 2023-05-05 13:04:43 It pulled down from 50% to 35.5%, and now has turned around and is headed back up about twice as fast. 2023-05-05 13:05:00 On the way down there was a one step up, two steps down thing going on, and now it's two steps up. 2023-05-05 13:05:47 Because the drive has two data connections, and one of those connections was laying down new data on unwritten resources right from the start - it was just the other one that was overwriting previously written 1:1 data. 2023-05-05 13:06:09 Took me a while to figure out when I expected it to turn around, but I did have the right answer before it turned. 2023-05-05 13:18:25 I also want to give these threads a mechanism for communicating with one another - I'm thinking along the lines of those "synchronous communicating processes" laid out in that Rob Pike paper we discussed a few months ago. 2023-05-05 13:18:35 Have to also decide how to implement that. 2023-05-05 13:19:10 Simplest way may be to just endow message pipes with names, and let threads connect to them that way. 2023-05-05 13:19:44 But I want that mechanism to support not just sending and receiving information, but also thread synchronization. 2023-05-05 14:03:31 The simplest way to do this is, effectively, a lock with some additional metadata. 2023-05-05 14:03:59 Synchronous communication means one thread is stuck in `send` until another thread executes `receive`. 2023-05-05 14:04:43 `send` plops down a datum, `receive` grabs that datum and clears the comm channel. 2023-05-05 14:05:15 Yes. 2023-05-05 14:05:54 Pike's paper was good - it seems like a good model for parallelizing certain things. 2023-05-05 14:06:24 He definitely didn't invent synchronous comms, but came up with a calculus that used synchronous comms. 2023-05-05 14:08:31 Yeah, it just happened to be his paper I read. He didn't claim it was his idea - he was more just "promoting it." 2023-05-05 14:08:39 That's one of the things I really dislike about the "actor model": liveness assertions become really, really difficult to ascertain when you allow for asynchronous, unbounded comms. 2023-05-05 14:09:01 Whereas with synchronous comms you can effectively just use TLA or some other temporal logic. 2023-05-05 14:09:27 You always know the threads that'll be alive at some point, because they're just communicating state machines. 2023-05-05 14:10:27 So if they're in a state that they can only exit via some `send` or `receive`, you know they're dead unless another thread is in the same state, communicating over the same port. 2023-05-05 14:12:24 what kind of conventions do you follow on your word names? 2023-05-05 14:13:10 Yes, and the resources that you need for a synchronous pipe are lower. It's either got something in it or it doesn't. You don't need a big buffer like you'd need for asynchronous ones. 2023-05-05 14:13:14 for example I append a * to indicate an alternative version of a word, which usually operates in a different way or taks different arguments 2023-05-05 14:14:02 I append a ~ to indicate a version of a word that does not return some value 2023-05-05 14:14:04 A common relationship you see is foo and (foo) where (foo) winds up being called by foo and performs the real work except accepts an additional parameter. 2023-05-05 14:14:15