mirror of
https://github.com/biopython/biopython.git
synced 2025-10-20 21:53:47 +08:00
Sample problematic GCG MSF file from IPD-IMGT/HLA database
https://github.com/ANHIG/IMGTHLA/blob/3300/msf/W_prot.msf as of commit d99d8aca3f01f7431741a998ea5cc2417d53ac9c (26 Oct 2017), i.e. from v3.30.0 of the IMGTHLA dtabase. This file has a discrepancy between the alignment length (99 columns) and four of the sequences (only 93 letters without trailing gap padding). The initial Biopython GCG MSF parser will accept this file (and apply the missing padding) with a warning.
This commit is contained in:
61
Tests/msf/W_prot.msf
Normal file
61
Tests/msf/W_prot.msf
Normal file
@ -0,0 +1,61 @@
|
||||
!!AA_MULTIPLE_ALIGNMENT
|
||||
|
||||
MSF: 99 Type: P Oct 18, 2017 11:35 Check: 0 ..
|
||||
|
||||
Name: W*01:01:01:01 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*01:01:01:02 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*01:01:01:03 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*01:01:01:04 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*01:01:01:05 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*01:01:01:06 Len: 99 Check: 7236 Weight: 1.00
|
||||
Name: W*02:01 Len: 93 Check: 9483 Weight: 1.00
|
||||
Name: W*03:01:01:01 Len: 93 Check: 9974 Weight: 1.00
|
||||
Name: W*03:01:01:02 Len: 93 Check: 9974 Weight: 1.00
|
||||
Name: W*04:01 Len: 93 Check: 9169 Weight: 1.00
|
||||
Name: W*05:01 Len: 99 Check: 7331 Weight: 1.00
|
||||
//
|
||||
|
||||
W*01:01:01:01 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*01:01:01:02 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*01:01:01:03 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*01:01:01:04 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*01:01:01:05 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*01:01:01:06 GLTPFNGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*02:01 GLTPSNGYTA ATWTRTAASS VGMNIPYDGA SYLVRNQELR SWTAADKAAQ
|
||||
W*03:01:01:01 GLTPSSGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*03:01:01:02 GLTPSSGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
W*04:01 GLTPSNGYTA ATWTRTAASS VGMNIPYDGA SYLVRNQELR SWTAADKAAQ
|
||||
W*05:01 GLTPSSGYTA ATWTRTAVSS VGMNIPYHGA SYLVRNQELR SWTAADKAAQ
|
||||
|
||||
W*01:01:01:01 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*01:01:01:02 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*01:01:01:03 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*01:01:01:04 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*01:01:01:05 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*01:01:01:06 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
W*02:01 MPWRRNMQSC SKPTCREGGR SGSAKSLRMG RRRCTAQNPK RLT
|
||||
W*03:01:01:01 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK RLT
|
||||
W*03:01:01:02 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK RLT
|
||||
W*04:01 MPWRRNMQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK RLT
|
||||
W*05:01 MPWRRNRQSC SKPTCREGGR SGSAKSLRMG RRGCSAQNPK DSHDPPPHL
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user