From e69e265f78025b80f4dc33409aa851a351ebe3bb Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Haoran=20S=2E=20Diao=20=28=E5=88=81=E6=B5=A9=E7=84=B6=29?= <0@hairydiode.xyz> Date: Fri, 30 Jun 2023 09:15:30 -0700 Subject: Jankime update, unihome prelim --- cont/jankime.html | 9 ++- cont/unihome.html | 150 ++++++++++++++++++++++++++++++++++++++++++++++ jankime.html | 9 ++- unihome.html | 175 ++++++++++++++++++++++++++++++++++++++++++++++++++++++ 4 files changed, 339 insertions(+), 4 deletions(-) create mode 100644 cont/unihome.html create mode 100644 unihome.html diff --git a/cont/jankime.html b/cont/jankime.html index 4d9623f..64a6d0d 100644 --- a/cont/jankime.html +++ b/cont/jankime.html @@ -5,6 +5,11 @@ [SETTITLE]Janky IME [SETDATE]6-29-2023 UPDATE: This IME is now tmux based, old xdotool version is still here + +UPDATE2: I have created the most cursed thing in existance. Full unicode display +and input support in the linux console using only bash/busybox and tmux. +See we have unicode at home + A new python version came out, so of course that means every python package on my rolling-release system has broken. This includes ibus, which I need for my input method. I'm currently running some web-crawling scripts that I don't want @@ -76,10 +81,10 @@ The Implementation: from line seperated to space seperated was done for free. However this makes the code less portable - CODE: + CODE: char=$(echo $opt | awk "{print \$1}") ... - xdotool type --window "$1" "$char" + tmux send-key -t "!" "$char" Downsides/The Future: This works in the linux console but obviously the linux console has diff --git a/cont/unihome.html b/cont/unihome.html new file mode 100644 index 0000000..953f0dd --- /dev/null +++ b/cont/unihome.html @@ -0,0 +1,150 @@ +[TITLE] [DATE] +-------------------------------------------------------------------------------- +[SETTITLE]We Have Unicode at Home +[SETDATE]6-30-2023 +Preface + it's just uses more memory, handwriting in the 70s, arabic/farsi + terminals, historically never existed an ascii only time. telegram codes +busybox + bash + sed + awk + grep + bc + iconv + xxd + read + sort + uniq + cat +tmux +kbd + console-braille +zpix bdf + 30M +zpix ttf + 4.5MiB +jizji + 1.3M +misaki + 747K +Google + 2.7 MB +LinBiolinumTI.pfb + 860KiB +HanaMinA + 22M,30M +unifont + 11.7MiB +Latex 2.9GiB +cm-super + 57.8MiB + just european languages + cyrillic +cbfonts + 70.6MiB +ensembl human genome + 4.5GiB +Rant + Aesthetics vs. Function + cool-retro-term, pixel fonts, monospace of chinese vs english + + + + The text confusion + In the beginning there was not the command line. There was wall + paintings bone etc + + Inefficiency + The only first class data types on a computer are int, + uint, and float. Why is there not universal way to + display/store them on posix systems, 256 combos per byte, + only 9 used, less than 5% efficiency + + HTML v. inefficient, easy to grep kinda + Json, v. inefficient + + Data confusion + IME + table takes in keypresses, spits out unicode character + keypresses should be own type, but is ascii, + what happens when different keyboard layout? + + What happens if typing russiand and want to use + vim or press C-c? + + Big table, very simple datatype, not first class + + Tree/files, super simple datatype, not first + class, file argument woes + + Display: + simply doing an OR required like 3 processes + because every program required different text + representation of the same data, even though + first class data type + + no language has first class lexer, closest is + awk + + bdf file ridiculously inefficient, keywords too + long, actual data is 2x by hexadec + representation + + bdf file is just a big table w/ 2d array as + output , very simple data type, have to do 1000 + conversions for input (decimal codepoint vs + 32bit vs utf-8), and output (2d array of bits vs + hex representation of the same) + + Big table no way to sort to make more efficient + + Representation + Forced to represent all out data so that the lowest + common denominator teletype in 1970s new jersey can + print it if we were to send it directly over serial + not just a bash issue: JSON, HTML, PDB, even + PDF/postscript + Ascii isn't event text, can't write accents or + directiona quotes or nn or even a bar over a letter. + Flipside, nobody who doesn't use posix knows or cares + what ~ and | are. + + Regex, same basic thing, 30 different variants, because + forced to represent as text with no specialized symbols + + same with code, every language has its own way of + representing a code block, none of which are + particularly legible + + if should be one key press and one byte + + In-band vs out of band + no universal way to embed data, json has directional + brackets, backslash hell is the norm, completely + avoidable, but the text obsession means type info is + ignored + guis + all based off of one dumb xerox experiment + all have same issues + lossy data display + no interop of actual data + no open loop input + no way to store input as its own data/scripting + + in memory data: + no interop, spend all your time using framework + libraries to convert data around. It's not just + a bash issue + + weird selection of first class data types, why + is text 1st class and not a mesh or a linked + list? +Rant + In the beginning, there was not a command line. In the beginning, there + was iron oxide pigment on torch lit cave walls, then there were stylus + indentations on clay, patterns carved on turtle shell, knots + tied in string, grooves cut in vinyl, and finally discrete states stored + in a great multitude of mechanisms. The universal datatype is not text, + it is uint_256, IEEE floating points. diff --git a/jankime.html b/jankime.html index 2abddec..3344c4a 100644 --- a/jankime.html +++ b/jankime.html @@ -21,6 +21,11 @@ 一二三四-->Janky IME 6-29-2023 -------------------------------------------------------------------------------- UPDATE: This IME is now tmux based, old xdotool version is still here + +UPDATE2: I have created the most cursed thing in existance. Full unicode display +and input support in the linux console using only bash/busybox and tmux. +See we have unicode at home + A new python version came out, so of course that means every python package on my rolling-release system has broken. This includes ibus, which I need for my input method. I'm currently running some web-crawling scripts that I don't want @@ -92,10 +97,10 @@ The Implementation: from line seperated to space seperated was done for free. However this makes the code less portable - CODE: + CODE: char=$(echo $opt | awk "{print \$1}") ... - xdotool type --window "$1" "$char" + tmux send-key -t "!" "$char" Downsides/The Future: This works in the linux console but obviously the linux console has diff --git a/unihome.html b/unihome.html new file mode 100644 index 0000000..c9744b3 --- /dev/null +++ b/unihome.html @@ -0,0 +1,175 @@ + + +We Have Unicode at Home + + + + + +
+
+--------------------------------------------------------------------------------
+
+>HairyDiode                                             
+
+--------------------------------------------------------------------------------
+We Have Unicode at Home                                                      6-30-2023
+--------------------------------------------------------------------------------
+Preface
+	it's just uses more memory, handwriting in the 70s, arabic/farsi
+	terminals, historically never existed an ascii only time. telegram codes
+busybox
+	bash
+	sed
+	awk
+	grep
+	bc
+	iconv
+	xxd
+	read
+	sort
+	uniq
+	cat
+tmux
+kbd
+	console-braille
+zpix bdf
+	30M
+zpix ttf
+	4.5MiB
+jizji
+	1.3M
+misaki
+	747K
+Google
+	2.7 MB
+LinBiolinumTI.pfb
+	860KiB
+HanaMinA
+	22M,30M
+unifont
+	11.7MiB
+Latex 2.9GiB
+cm-super
+	57.8MiB
+		just european languages + cyrillic
+cbfonts
+	70.6MiB
+ensembl human genome
+	4.5GiB
+Rant
+	Aesthetics vs. Function
+	cool-retro-term, pixel fonts, monospace of chinese vs english
+	
+	
+
+	The text confusion
+		In the beginning there was not the command line. There was wall
+		paintings bone etc
+
+		Inefficiency
+			The only first class data types on a computer are int,
+			uint, and float. Why is there not universal way to
+			display/store them on posix systems, 256 combos per byte,
+			only 9 used, less than 5% efficiency
+
+			HTML v. inefficient, easy to grep kinda 
+			Json, v. inefficient
+
+		Data confusion
+			IME
+			table takes in keypresses, spits out unicode character
+				keypresses should be own type, but is ascii,
+				what happens when different keyboard layout?
+
+				What happens if typing russiand and want to use
+				vim or press C-c?
+
+				Big table, very simple datatype, not first class
+
+				Tree/files, super simple datatype, not first
+				class, file argument woes
+
+			Display:
+				simply doing an OR required like 3 processes
+				because every program required different text
+				representation of the same data, even though
+				first class data type
+				
+				no language has first class lexer, closest is
+				awk
+
+				bdf file ridiculously inefficient, keywords too
+				long, actual data is 2x by hexadec
+				representation
+
+				bdf file is just a big table w/ 2d array as
+				output , very simple data type, have to do 1000
+				conversions for input (decimal codepoint vs
+				32bit vs utf-8), and output (2d array of bits vs
+				hex representation of the same)
+
+			Big table no way to sort to make more efficient
+
+		Representation
+			Forced to represent all out data so that the lowest
+			common denominator teletype in 1970s new jersey can
+			print it if we were to send it directly over serial
+				not just a bash issue: JSON, HTML, PDB, even
+				PDF/postscript
+			Ascii isn't event text, can't write accents or
+			directiona quotes or nn or even a bar over a letter.
+			Flipside, nobody who doesn't use posix knows or cares
+			what ~ and | are.
+
+			Regex, same basic thing, 30 different variants, because
+			forced to represent as text with no specialized symbols
+
+			same with code, every language has its own way of
+			representing a code block, none of which are
+			particularly legible
+
+			if should be one key press and one byte
+
+		In-band vs out of band
+			no universal way to embed data, json has directional
+			brackets, backslash hell is the norm, completely
+			avoidable, but the text obsession means type info is
+			ignored
+		guis
+			all based off of one dumb xerox experiment
+			all have same issues
+				lossy data display
+				no interop of actual data
+				no open loop input
+				no way to store input as its own data/scripting
+		
+			in memory data:
+				no interop, spend all your time using framework
+				libraries to convert data around. It's not just
+				a bash issue
+
+				weird selection of first class data types, why
+				is text 1st class and not a mesh or a linked
+				list?
+Rant
+	In the beginning, there was not a command line. In the beginning, there
+	was iron oxide pigment on torch lit cave walls, then there were stylus
+	indentations on clay, patterns carved on turtle shell, knots
+	tied in string, grooves cut in vinyl, and finally discrete states stored
+	in a great multitude of mechanisms. The universal datatype is not text,
+	it is uint_256, IEEE floating points.
+
+
+
+
+ + -- cgit v1.1