WaveTree - Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

  • 5 Replies
  • 4321 Views
*

Ben.F.Rayfield

  • Bumblebee
  • **
  • 38
Immutable Sparse Wave Trees (WaveTree)
Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

https://sourceforge.net/projects/wavetree version 0.2.0 is a 84 kB jar file containing its own source code.

Opensource GNU LGPL 2+

Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of that as Bits, subranges of them, and ints for sizes of its parts. Expansion ability for other kinds of compression, since Bits is a Java interface. Main functions on bits are substring, concat, number of 0 or 1 bits, and number of bits (size). All those operations can be done millions of times per second regardless of size because the AVL forest reuses existing branches recursively. Theres a scalar (originally for copy/pasting subranges of sounds) and a bit Java package. Sparse n dimensional matrix.

AVL tree balancing avoids deep and slow forest

Bits substring, concat, and count 1 bits in any subrange or combination costs only log time and memory (millions of times per second on average computer)

Versioning on N dimensional matrix object (Multidim) since its only a view of Bits object. I've tested this on 10000 images from MNIST OCR data.

Scalar and Bit versions - Originally was scalar for copy/paste subranges of sound. Same operations work for bit strings

Can store sounds that are years long since its sparse. Same works for bit strings up to 2^63.

-----

I too often get lost in excessive abstraction, but I'm getting back to keeping it real. This is 1 of my tools that I only came to understand the need for after years of research. It will be at the core of my game, AI, and science network, along with my new kind of mindmap and statistical tools like boltzmann machines and bayesian networks. All those things will be represented using this foundation of bits.

*

Art

  • At the end of the game, the King and Pawn go into the same box.
  • Trusty Member
  • **********************
  • Colossus
  • *
  • 5865
Nice to see you back after your last visit. Hope you've been well and weathering our wonder weather.
I'm not far from NG near Springfield....

Any sample images of your forest?
In the world of AI, it's the thought that counts!

*

Ben.F.Rayfield

  • Bumblebee
  • **
  • 38
You can see wavetree used in http://sf.net/projects/physicsmata version 2.0.0 where its used for a 3d array of bits for some of the mnist ocr dataset.

*

ranch vermin

  • Not much time left.
  • Terminator
  • *********
  • 947
  • Its nearly time!
63 bits is not very much.   my video tracker uses 1024 bits already,  and sift uses 1024 BYTES!!!

*

Ben.F.Rayfield

  • Bumblebee
  • **
  • 38
You're talking about an encryption system, like Netflix uses to prevent you from directly copying the video bits. No video files exist more than 2^63 bits or even close to it. You could fit a whole Human brain in 2^63 bits. If it was to hold the whole internet I'd go with 2^128 just to have lots of space left over for expansion over many years, but 1024 is way overkill, and you'll pay for that in hardware only being optimized for 64 bits at a time.

*

Korrelan

  • Trusty Member
  • ***********
  • Eve
  • *
  • 1454
  • Look into my eyes! WOAH!
    • YouTube
It thunk... therefore it is!...    /    Project Page    /    KorrTecx Website

 


Requirements for functional equivalence to conscious processing?
by DaltonG (General AI Discussion)
November 19, 2024, 11:56:05 am
Will LLMs ever learn what is ... is?
by HS (Future of AI)
November 10, 2024, 06:28:10 pm
Who's the AI?
by frankinstien (Future of AI)
November 04, 2024, 05:45:05 am
Project Acuitas
by WriterOfMinds (General Project Discussion)
October 27, 2024, 09:17:10 pm
Ai improving AI
by infurl (AI Programming)
October 19, 2024, 03:43:29 am
Atronach's Eye
by WriterOfMinds (Home Made Robots)
October 13, 2024, 09:52:42 pm
Running local AI models
by spydaz (AI Programming)
October 07, 2024, 09:00:53 am
Hi IM BAA---AAACK!!
by MagnusWootton (Home Made Robots)
September 16, 2024, 09:49:10 pm
LLaMA2 Meta's chatbot released
by spydaz (AI News )
August 24, 2024, 02:58:36 pm
ollama and llama3
by spydaz (AI News )
August 24, 2024, 02:55:13 pm
AI controlled F-16, for real!
by frankinstien (AI News )
June 15, 2024, 05:40:28 am
Open AI GPT-4o - audio, vision, text combined reasoning
by MikeB (AI News )
May 14, 2024, 05:46:48 am
OpenAI Speech-to-Speech Reasoning Demo
by MikeB (AI News )
March 31, 2024, 01:00:53 pm
Say good-bye to GPUs...
by MikeB (AI News )
March 23, 2024, 09:23:52 am
Google Bard report
by ivan.moony (AI News )
February 14, 2024, 04:42:23 pm
Elon Musk's xAI Grok Chatbot
by MikeB (AI News )
December 11, 2023, 06:26:33 am

Users Online

302 Guests, 1 User
Users active in past 15 minutes:
squarebear
[Trusty Member]

Most Online Today: 497. Most Online Ever: 2369 (November 21, 2020, 04:08:13 pm)

Articles