[PATCH] fbuffer: improve toggle cursor performance

Greg Kurz gkurz at linux.vnet.ibm.com
Wed May 27 19:24:48 AEST 2015


On Wed, 27 May 2015 07:59:34 +0200
Thomas Huth <thuth at redhat.com> wrote:

> On Wed, 27 May 2015 02:11:13 +0200
> Greg Kurz <gkurz at linux.vnet.ibm.com> wrote:
> 
> > SLOF currently calls hv-logical-load and hv-logical-store for every pixel
> > when enabling or disabling the cursor. This is suboptimal when writing one
> > char at a time to the console since terminal-write always toggles the cursor.
> > And this is precisely what grub is doing when the user wants to edit a menu
> > entry... the result is an incredibly slow and barely usable interface.
> > 
> > The inner loop in fb8-toggle-cursor handles a contiguous region: it can be
> > converted to hv-logical-memop. The result is 32 times less hcalls per char
> > and a serious improvement in grub usability.
> 
> Good idea for an optimization!
> 

Heh no big deal... the hardest part was to find that the LOAD/STORE avalanche
was coming from these rb@ and rb! words. SLOF is still a mysterious beast to
me :)

> > Signed-off-by: Greg Kurz <gkurz at linux.vnet.ibm.com>
> > ---
> >  slof/fs/fbuffer.fs |    4 ++--
> >  1 file changed, 2 insertions(+), 2 deletions(-)
> > 
> > diff --git a/slof/fs/fbuffer.fs b/slof/fs/fbuffer.fs
> > index 756f05a..46b59bf 100644
> > --- a/slof/fs/fbuffer.fs
> > +++ b/slof/fs/fbuffer.fs
> > @@ -99,8 +99,8 @@ CREATE bitmap-buffer 400 4 * allot
> >  : fb8-toggle-cursor ( -- )
> >  	line# fb8-line2addr column# fb8-columns2bytes +
> >  	char-height 0 ?DO
> > -		char-width screen-depth * 0 ?DO dup dup rb@ -1 xor swap rb! 1+ LOOP
> > -		screen-width screen-depth * + char-width screen-depth * -
> > +		dup dup 0 char-width screen-depth * 1 hv-logical-memop drop
> > +		screen-width screen-depth * +
> >  	LOOP drop
> >  ;
> 
> If you use hv-logical-memop in this file here, you definitely break
> board-js2x, since this is bare metal and hv-logical-memop is not
> defined there.
> 

Of course, this is common code... I'll remember for next time. :)

> I think you should either move the new function to board-qemu and handle
> it there like it is done for hcall-invert-screen already, or we could
> think of introducing a helper function that is defined by each board
> which does the xor operation on a memory region (that way we could
> maybe also unify hcall-invert-screen and fb8-invert-screen again).
> 

I guess the first proposal is the obvious fix. From there, we can
work out a patchset for the second proposal.

>  Thomas
> 

Cheers.

--
Greg



More information about the Linuxppc-dev mailing list