Re: [wsjt-devel] Anybody working on wsprd these days?

Phil Karn via wsjt-devel Mon, 09 Aug 2021 04:10:03 -0700

On 8/9/21 02:39, William Smith wrote:

Personally I want my stuff to be _able_ to operate independently ofthe Internet, though I'm happy to have it around for normal operation,and to detect (for a recent instance) things like my external GPSantenna on my local Pi4-based NTP server failing.

Right. I feel the same way about repeater linking. It's fun to link hamrepeaters over the Internet, but if we're at all serious about theEmComm aspect of ham radio we should at least be aware of how dependentwe are on it. One of my absolute favorite Prof. Andrew Tannenbaumquotes: "Distributed computing is when a computer you didn't evenrealize you were using is keeping you from getting any work done."

Would it make any sense to try the decoding over several settings forthe decoding limit, or is that too meta?

No. A Fano decoder explores (part of) a binary tree representing allpossible messages. WSPR messages are 50 bits so there are 2^50 possiblemessages -- many more than you could ever fully explore even withtoday's computers, so it can only search a small subset of paths throughthe tree. It starts at the root (the beginning of the message) and,using the noisy received symbols, chooses which branch -- message bit 0or 1 -- looks like the "better" one. It continues down that branch,decoding additional bits, as long as it seems to be on the right path.But if it has made a wrong turn, neither choice will match what it'sgetting; nothing will "make sense". So it backs up and explores downanother path. If that doesn't work, it backs up even more and repeatsthe process.

When things go well (the SNR is high) it rarely if ever makes a wrongturn, so it zips right through. The average number of decoder "moves"per decoded bit is 1 or only slightly more. But when the signal isnoisy, it will spend a lot of time running into dead ends, repeatedlybacking out and exploring alternate paths. If the signal is *very*noisy, it may never make it through before it exceeds some preset limiton how many total moves are allowed. That's the decoding limit you setat the start. If you try several times with successively higher decodinglimits, you're just wasting the CPU time you spent on the earlieraborted attempts. You might as well just pick a large decoding limit tostart with.

If you're concerned about spending too much CPU time in a difficultdecode and holding up other messages that may decode more easily, theanswer here is to run each decoder in its own thread and let theoperating system handle the scheduling. You'll still want to set a limiton the decoding effort just to avoid false decodes.

Sequential decoding (of which the Fano and stack algorithms are twoversions) is very much like solving a Sudoku puzzle. With the easypuzzles it is always fairly obvious what number goes next in whatsquare, and you rarely find yourself in a dead end that requires you toback up, erase earlier guess(es) and try others. The really hard puzzlesare designed to make you do that a lot. (Personally, solving Sudokupuzzles by hand seemed awfully tedious, so I wrote a program to solvethem using basically this same algorithm. Works in milliseconds even onthe "monster" puzzles. My wife thinks that's cheating. "Why? I had tothink hard about how I'd solve *every* Sudoku puzzle ever created, orwill be created, not just the one in front of me. How is that cheating?")

Or is it actually time spent running over the dataset, and there's noway to tell how 'well' the decode worked? Obviously, getting a resultsooner rather than later gives you a better 'quality metric' (orwhatever it would be called), so is it worth keeping the data and thisscore and using a running average of the score to set your threshold? [Or am I, more likely, talking through my hat and just muddying thewaters?]

No, these are all very reasonable questions. 'wsprd' includes all thisinformation in its output files, though they don't seem terribly welldocumented. Look at the log file ALL_WSPR.txt. The second-to-last columnis the average number of decoder cycles per bit (truncated down to aninteger; ideally it would be shown as a float) and the last column isthe 'path metric'.

The 'path metric' is a measure of how closely the noisy received symbolsmatch what they should be for the decoded message. I.e., it's a measureof how confident the decoder is in its result. During decoding, thedecoder uses the path metric up to that point to decide its next move;it keeps moving forward as long as the metric is improving. If it keepsgetting worse it'll back up and try another path until it improvesagain. (I wrote this code so long ago I can't even remember if a "good"metric is positive or negative!)

Path metrics can be very tricky with sequential decoding, as itimplicitly compares paths of different lengths and this requiresaccurate estimates of the signal and (especially) noise levels -- whichis basically what you're trying to figure out! I haven't dug too deeplyinto the code yet but I suspect that the SNR values we see from wsprdare actually computed from the synchronization sequences, not the Fanodecoder metrics.

One of the main reasons that sequential decoding fell out of favor whenViterbi discovered his algorithm was that the Viterbi algorithm onlycompares paths of equal lengths. This makes the Viterbi algorithm muchless sensitive to errors in the SNR estimate, especially when "softdecision decoding" is used. wsprd is one of a relative few applicationsof sequential decoding that uses soft decision decoding. Most sequentialdecoding applications I found in the literature seemed to just punt.They use it in a "hard decision" mode, which loses about 2 dB in SNRperformance.

Viterbi also runs at a constant speed no matter how noisy the inputstream may be. But Viterbi is limited to short constraint lengths (lesscoding gain) and it will gladly decode garbage from pure noise, so youneed another layer of FEC to correct or at least detect uncorrectederrors from the Viterbi decoder. This is usually done by "concatenating"a Reed Solomon block code on top of Viterbi. This was developed for theVoyager spacecraft, where it stood as the state of the art until thediscovery of turbo coding in the early 1990s and the rediscovery of LDPCa little later. Both the Voyager concatenated code and the sequentiallydecoded convolutional code used in wspr require about the same per-bitSNR, about 2.5 - 3 dB.


Phil







_______________________________________________
wsjt-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/wsjt-devel

Re: [wsjt-devel] Anybody working on wsprd these days?

Reply via email to