@FreddyV; Have you tried replacing the first "xchg ah, al" with a "mov ah, al"? It should be slightly faster in theory (if my documentation is correct).
It would be interesting to see the difference in performance (on various machines, with 8088 and 8086 CPUs) so if anyone wants to implement your suggestion in the actual XUB code and test/benchmark it then I might consider adding another "controller" specifically for 8086 machines if there's a significant difference.
It would be interesting to see the difference in performance (on various machines, with 8088 and 8086 CPUs) so if anyone wants to implement your suggestion in the actual XUB code and test/benchmark it then I might consider adding another "controller" specifically for 8086 machines if there's a significant difference.