Reducing register ports using delayed write-back queues and operand pre-fetch