Fix a bug in thread local reloations
There was a bug (also uncovered via binutils) where R_X86_64_DTPOFF64 is
set uncorrectly. This program is the minimal reproducer of the seg fault
  #include <errno.h>
  int main() {
        int oerrno = errno;
  }
But it works after the bug fix.