WalnutLum

joined 2 years ago
[–] WalnutLum@lemmy.ml 5 points 3 weeks ago (1 children)

I'm not sure if the Gutenberg Press had only produced one readable copy for every 100 printed it would have been the literary revolution that it was.

[–] WalnutLum@lemmy.ml 2 points 3 weeks ago* (last edited 3 weeks ago)

I'm not sure if it would work for your situation but you seem to be able to ssh into a server on that network? If so you can run a browser on that computer and tunnel the X session over ssh:

https://www.cyberciti.biz/tips/running-x-window-graphical-application-over-ssh-session.html

Otherwise neko seems neat, I've actually been looking for something for watch parties.

[–] WalnutLum@lemmy.ml 58 points 3 weeks ago (15 children)

The Blog Post from the researcher is a more interesting read.

Important points here about benchmarking:

o3 finds the kerberos authentication vulnerability in the benchmark in 8 of the 100 runs. In another 66 of the runs o3 concludes there is no bug present in the code (false negatives), and the remaining 28 reports are false positives. For comparison, Claude Sonnet 3.7 finds it 3 out of 100 runs and Claude Sonnet 3.5 does not find it in 100 runs.

o3 finds the kerberos authentication vulnerability in 1 out of 100 runs with this larger number of input tokens, so a clear drop in performance, but it does still find it. More interestingly however, in the output from the other runs I found a report for a similar, but novel, vulnerability that I did not previously know about. This vulnerability is also due to a free of sess->user, but this time in the session logoff handler.

I'm not sure if a signal to noise ratio of 1:100 is uh... Great...

[–] WalnutLum@lemmy.ml 1 points 3 weeks ago (1 children)

This would feel a lot less gross if this had been with an open model like deepseek-r1.

[–] WalnutLum@lemmy.ml 8 points 3 weeks ago* (last edited 3 weeks ago)

It's not just helicopters. Commercial satellite imaging is good enough to detect mold and askew shingles (usually more through running the image over multiple angles and finding reflectance differences)

I worked for a company that does large scale construction updates based on SAR and Maxtor reflectance data, it's pretty terrifying how accurate it is.

[–] WalnutLum@lemmy.ml 5 points 3 weeks ago (1 children)

Looking forward to every other country on earth advancing space exploration while America feeds SpaceX more money to blow up endangered bird sanctuaries.

[–] WalnutLum@lemmy.ml 1 points 3 weeks ago

I'm not sure how you're getting wallpaper engine to work on Linux because it's not supported on anything other than windows.

Are you using Wallpaper Engine? If so you are likely going to keep having issues with your screen blanking while you try and use it, as it's not supported on Linux.

[–] WalnutLum@lemmy.ml 1 points 3 weeks ago (1 children)

Yea I think "8-12 launches" is the ideal with the launches being at a steady pace (not taking into account weather, launch problems requiring delays etc.)

[–] WalnutLum@lemmy.ml 22 points 3 weeks ago (5 children)

The article you're commenting on is about EU grocery store honey being fake

[–] WalnutLum@lemmy.ml 5 points 3 weeks ago

beating SLS in just tests

Technically the booster/starship combo has yet to lift the tonnage that SLS already lifted with Artemis I.

It's obvious that it'll be cheaper per ton than SLS but It's still a little early to say what level of cost savings it has until we know how many tons super heavy and starship can actually lift. (The estimate SpaceX has been giving goes down by 50 tons every year)

[–] WalnutLum@lemmy.ml 1 points 3 weeks ago (3 children)

There's that Smarter Every Day video that says something like 8-12 (joke is that with schedule slippage it's more like 20)

[–] WalnutLum@lemmy.ml 30 points 3 weeks ago (3 children)

Best part of this is that they just won the contract for launch services even though they still haven't proven the thing can fly without blowing up yet.

view more: ‹ prev next ›