========================== KERNEL ABIS FOR METAG ARCH ========================== This document describes the Linux ABIs for the metag architecture, and has the following sections: (*) Outline of registers (*) Userland registers (*) Kernel registers (*) System call ABI (*) Calling conventions ==================== OUTLINE OF REGISTERS ==================== The main Meta core registers are arranged in units: UNIT Type DESCRIPTION GP EXT PRIV GLOBAL ======= ======= =============== ======= ======= ======= ======= CT Special Control unit D0 General Data unit 0 0-7 8-15 16-31 16-31 D1 General Data unit 1 0-7 8-15 16-31 16-31 A0 General Address unit 0 0-3 4-7 8-15 8-15 A1 General Address unit 1 0-3 4-7 8-15 8-15 PC Special PC unit 0 1 PORT Special Ports TR Special Trigger unit 0-7 TT Special Trace unit 0-5 FX General FP unit 0-15 GP registers form part of the main context. Extended context registers (EXT) may not be present on all hardware threads and can be context switched if support is enabled and the appropriate bits are set in e.g. the D0.8 register to indicate what extended state to preserve. Global registers are shared between threads and are privilege protected. See arch/metag/include/asm/metag_regs.h for definitions relating to core registers and the fields and bits they contain. See the TRMs for further details about special registers. Several special registers are preserved in the main context, these are the interesting ones: REG (ALIAS) PURPOSE ======================= =============================================== CT.1 (TXMODE) Processor mode bits (particularly for DSP) CT.2 (TXSTATUS) Condition flags and LSM_STEP (MGET/MSET step) CT.3 (TXRPT) Branch repeat counter PC.0 (PC) Program counter Some of the general registers have special purposes in the ABI and therefore have aliases: D0 REG (ALIAS) PURPOSE D1 REG (ALIAS) PURPOSE =============== =============== =============== ======================= D0.0 (D0Re0) 32bit result D1.0 (D1Re0) Top half of 64bit result D0.1 (D0Ar6) Argument 6 D1.1 (D1Ar5) Argument 5 D0.2 (D0Ar4) Argument 4 D1.2 (D1Ar3) Argument 3 D0.3 (D0Ar2) Argument 2 D1.3 (D1Ar1) Argument 1 D0.4 (D0FrT) Frame temp D1.4 (D1RtP) Return pointer D0.5 Call preserved D1.5 Call preserved D0.6 Call preserved D1.6 Call preserved D0.7 Call preserved D1.7 Call preserved A0 REG (ALIAS) PURPOSE A1 REG (ALIAS) PURPOSE =============== =============== =============== ======================= A0.0 (A0StP) Stack pointer A1.0 (A1GbP) Global base pointer A0.1 (A0FrP) Frame pointer A1.1 (A1LbP) Local base pointer A0.2 A1.2 A0.3 A1.3 ================== USERLAND REGISTERS ================== All the general purpose D0, D1, A0, A1 registers are preserved when entering the kernel (including asynchronous events such as interrupts and timer ticks) except the following which have special purposes in the ABI: REGISTERS WHEN STATUS PURPOSE =============== ======= =============== =============================== D0.8 DSP Preserved ECH, determines what extended DSP state to preserve. A0.0 (A0StP) ALWAYS Preserved Stack >= A0StP may be clobbered at any time by the creation of a signal frame. A1.0 (A1GbP) SMP Clobbered Used as temporary for loading kernel stack pointer and saving core context. A0.15 !SMP Protected Stores kernel stack pointer. A1.15 ALWAYS Protected Stores kernel base pointer. On UP A0.15 is used to store the kernel stack pointer for storing the userland context. A0.15 is global between hardware threads though which means it cannot be used on SMP for this purpose. Since no protected local registers are available A1GbP is reserved for use as a temporary to allow a percpu stack pointer to be loaded for storing the rest of the context. ================ KERNEL REGISTERS ================ When in the kernel the following registers have special purposes in the ABI: REGISTERS WHEN STATUS PURPOSE =============== ======= =============== =============================== A0.0 (A0StP) ALWAYS Preserved Stack >= A0StP may be clobbered at any time by the creation of an irq signal frame. A1.0 (A1GbP) ALWAYS Preserved Reserved (kernel base pointer). =============== SYSTEM CALL ABI =============== When a system call is made, the following registers are effective: REGISTERS CALL RETURN =============== ======================= =============================== D0.0 (D0Re0) Return value (or -errno) D1.0 (D1Re0) System call number Clobbered D0.1 (D0Ar6) Syscall arg #6 Preserved D1.1 (D1Ar5) Syscall arg #5 Preserved D0.2 (D0Ar4) Syscall arg #4 Preserved D1.2 (D1Ar3) Syscall arg #3 Preserved D0.3 (D0Ar2) Syscall arg #2 Preserved D1.3 (D1Ar1) Syscall arg #1 Preserved Due to the limited number of argument registers and some system calls with badly aligned 64-bit arguments, 64-bit values are always packed in consecutive arguments, even if this is contrary to the normal calling conventions (where the two halves would go in a matching pair of data registers). For example fadvise64_64 usually has the signature: long sys_fadvise64_64(i32 fd, i64 offs, i64 len, i32 advice); But for metag fadvise64_64 is wrapped so that the 64-bit arguments are packed: long sys_fadvise64_64_metag(i32 fd, i32 offs_lo, i32 offs_hi, i32 len_lo, i32 len_hi, i32 advice) So the arguments are packed in the registers like this: D0 REG (ALIAS) VALUE D1 REG (ALIAS) VALUE =============== =============== =============== ======================= D0.1 (D0Ar6) advice D1.1 (D1Ar5) hi(len) D0.2 (D0Ar4) lo(len) D1.2 (D1Ar3) hi(offs) D0.3 (D0Ar2) lo(offs) D1.3 (D1Ar1) fd =================== CALLING CONVENTIONS =================== These calling conventions apply to both user and kernel code. The stack grows from low addresses to high addresses in the metag ABI. The stack pointer (A0StP) should always point to the next free address on the stack and should at all times be 64-bit aligned. The following registers are effective at the point of a call: REGISTERS CALL RETURN =============== ======================= =============================== D0.0 (D0Re0) 32bit return value D1.0 (D1Re0) Upper half of 64bit return value D0.1 (D0Ar6) 32bit argument #6 Clobbered D1.1 (D1Ar5) 32bit argument #5 Clobbered D0.2 (D0Ar4) 32bit argument #4 Clobbered D1.2 (D1Ar3) 32bit argument #3 Clobbered D0.3 (D0Ar2) 32bit argument #2 Clobbered D1.3 (D1Ar1) 32bit argument #1 Clobbered D0.4 (D0FrT) Clobbered D1.4 (D1RtP) Return pointer Clobbered D{0-1}.{5-7} Preserved A0.0 (A0StP) Stack pointer Preserved A1.0 (A0GbP) Preserved A0.1 (A0FrP) Frame pointer Preserved A1.1 (A0LbP) Preserved A{0-1},{2-3} Clobbered 64-bit arguments are placed in matching pairs of registers (i.e. the same register number in both D0 and D1 units), with the least significant half in D0 and the most significant half in D1, leaving a gap where necessary. Further arguments are stored on the stack in reverse order (earlier arguments at higher addresses): ADDRESS 0 1 2 3 4 5 6 7 =============== ===== ===== ===== ===== ===== ===== ===== ===== A0StP --> A0StP-0x08 32bit argument #8 32bit argument #7 A0StP-0x10 32bit argument #10 32bit argument #9 Function prologues tend to look a bit like this: /* If frame pointer in use, move it to frame temp register so it can be easily pushed onto stack */ MOV D0FrT,A0FrP /* If frame pointer in use, set it to stack pointer */ ADD A0FrP,A0StP,#0 /* Preserve D0FrT, D1RtP, D{0-1}.{5-7} on stack, incrementing A0StP */ MSETL [A0StP++],D0FrT,D0.5,D0.6,D0.7 /* Allocate some stack space for local variables */ ADD A0StP,A0StP,#0x10 At this point the stack would look like this: ADDRESS 0 1 2 3 4 5 6 7 =============== ===== ===== ===== ===== ===== ===== ===== ===== A0StP --> A0StP-0x08 A0StP-0x10 A0StP-0x18 Old D0.7 Old D1.7 A0StP-0x20 Old D0.6 Old D1.6 A0StP-0x28 Old D0.5 Old D1.5 A0FrP --> Old A0FrP (frame ptr) Old D1RtP (return ptr) A0FrP-0x08 32bit argument #8 32bit argument #7 A0FrP-0x10 32bit argument #10 32bit argument #9 Function epilogues tend to differ depending on the use of a frame pointer. An example of a frame pointer epilogue: /* Restore D0FrT, D1RtP, D{0-1}.{5-7} from stack, incrementing A0FrP */ MGETL D0FrT,D0.5,D0.6,D0.7,[A0FrP++] /* Restore stack pointer to where frame pointer was before increment */ SUB A0StP,A0FrP,#0x20 /* Restore frame pointer from frame temp */ MOV A0FrP,D0FrT /* Return to caller via restored return pointer */ MOV PC,D1RtP If the function hasn't touched the frame pointer, MGETL cannot be safely used with A0StP as it always increments and that would expose the stack to clobbering by interrupts (kernel) or signals (user). Therefore it's common to see the MGETL split into separate GETL instructions: /* Restore D0FrT, D1RtP, D{0-1}.{5-7} from stack */ GETL D0FrT,D1RtP,[A0StP+#-0x30] GETL D0.5,D1.5,[A0StP+#-0x28] GETL D0.6,D1.6,[A0StP+#-0x20] GETL D0.7,D1.7,[A0StP+#-0x18] /* Restore stack pointer */ SUB A0StP,A0StP,#0x30 /* Return to caller via restored return pointer */ MOV PC,D1RtP