I had a look at the code again, and I got it running faster.
It always gets better the more I sit and tinker with something.
Ultron, I didnt believe I could get one of these video nets running on one gpu until very recently. The finished one will work on 4 or so at once, in sli.
Art, that music is my favorite for doing this mechanization stuff with. Futuristic drum and bass really puts me in the mood for making a robot. hehe. And trust me its definitely better with 1024 bit descriptor, I can vouch for that now I looked at it for ages, the surface positions arent stable without it.
How to see the backsides of things? Very simple, Just as a markov chain word predictor will predict words, you just have to think in 3d space instead, and wrap around the other side as if it was a sentence of words. Just saying generally.
So if you see a foot, you have a person.
If you see a basketball, maybe theres a kid.
If its done statistically and stochastically it may be quite amusing watching it come up with random scenes.
Bringing back animation, is as if Ive put all the capture into lots of backpropagator's, doing everything at once, and I bring it back like the animation in GTA V. But all done in a "greedy fell swooping fashion"... which is what my middle name phrase is, definitely not originality.
Which interestingly I dont think there is much originality required in the ai field, (especially thinking about computer vision.) its all a whole load of already done and thought about stuff that needs to be put together and implemented. Its just something you can do for yourself, so you dont have to pay anyone.
But not saying there isnt more to theorize, especially for less artificial vision, which this is very artificial - and has trouble with translucency.
I wonder how good some of the hollywood visual effects mocap programmers get their algorythms! I think it must be better than mine, some of the movies are pretty schmik. This 'universal capture' stuff has been out since the matrix movie with Keanu, but they did it in a camera room based system, and I guess these days its all about getting it work out of just one ordinary camera.