MAGIIC-PRO : Experiments and Examples
 

We provide a lot of experiments and examples to demonstrate the capability of WildSpan in identifying flexible and long patterns. All the experiments were conducted on a machine with a 3.4GHz Intel Pentium 4 CPU and memory of 2GBs, running Linux Fedora 4 Server.

 

The table below shows each of dataset and its corresponding description used in this paper. Click the title of query protein to view its mining results.

ID Query Protein Training data set Training data size
1 P36507 | MP2K2_HUMAN | MAPK/ERK kinase 2 PROSITE PS00107 (Protein kinases ATP-binding region signature) 1910
2

Q01698 | EFTU_THEAQ | Elongation factor Tu

PROSITE PS00301 (GTP-binding elongation factors signature )

932
3

P19120 | HSP7C_BOVIN | Heat shock cognate 71 kDa protein

InterPro IPR001023  (Heat shock protein Hsp70)

496
4

P51656 (DHB1_MOUSE) Estradiol 17-beta-dehydrogenase 1

Pfam PF00106 and PROSITE  PS00061 (short chain dehydrogenase)

494
5

P00962 (SYQ_ECOLI) Glutaminyl-tRNA synthetase

Pfam PF00749 (tRNA synthetases class I (E and Q), catalytic domain)

346
6

Q01574 (ACS1_YEAST) Acetyl-coenzyme A synthetase 1

PROSITE PS00455 (Putative AMP-binding domain signature)

282
7

P10933 (FENR1_PEA ) Ferredoxin--NADP reductase

InterPro IPR001433 (Oxidoreductase FAD/NAD(P)-binding)

280
8

P08622 (DNAJ_ECOLI) Chaperone protein dnaJ

Pfam PF00684 and PROSITE PS00637 (dnaJ central domain signature)

275
9

P25910 (BLAB_BACFR) Beta-lactamase type II precursor

Pfam PF00753 (Metallo-beta-lactamase superfamily)

267
10

P36204 (PGKT_THEMA) Bifunctional PGK/TIM

PROSITE PS00111 (Phosphoglycerate kinase signature)

251
11

P27142 (KAD_BACST) Adenylate kinase

PROSITE PS00113 (Adenylate kinase signature)

243
12

P22887(NDKC_DICDI) NDP kinase

PROSITE PS00469 (Nucleoside diphosphate kinases active site )

237
13

P09372 (GRPE_ECOLI) Protein grpE

PROSITE PS01071 (grpE Protein Signature)

195
14 Q58504(KHSE_METJA) Homoserine kinase PROSITE PS00627 (GHMP kinases putative ATP-binding domain ) 191
15 Q58487 (KIME_METJA) Mevalonate kinase PROSITE PS00627 (GHMP kinases putative ATP-binding domain ) 191
16

P37744(RMLA1_ECOLI)  dTDP- glucose synthase 1

Pfam PF00483 (NTP_transferase; 1.) 188
17

P00817(IPYR_YEAST) Inorganic pyrophosphatase

PROSITE PS00387 (Inorganic pyrophosphatase signature)

181
18

P25971(PYRF_BACSU ) Orotidine 5'-phosphate decarboxylase

Pfam PF00215 (Orotidine 5'-phosphate decarboxylase / HUMPS family)

178
19 Q64520(KGUA_MOUSE) Guanylate kinase Pfam PF00625 (Guanylate kinase) 141
20

P51541 (KARG_LIMPO) Arginine kinase

PROSITE PS00112 (ATP:guanido phosphotransferases active site)

81
21

P03958 (ADA_MOUSE) Adenosine deaminase

PROSITE PS00485 (Adenosine and AMP deaminase signature)

59
22

P23368 (MAOM_HUMAN) NAD-dependent malic enzyme

PROSITE PS00331 (Malic enzymes signature)

43
23 1agr:A | Complex of ALF4-ACTIVATED GI-ALPHA-1 with RGS4 100 homologues found by PSI-BLAST 101
24 1wql:R | GAPETTE 100 homologues found by PSI-BLAST 101
25 1ds6:A |A RAC-RHOGDI complex 100 homologues found by PSI-BLAST 101
26 1dtd:A | The leech and human caboxypeptidase complex 47 homologues found by PSI-BLAST 47
27 1wql:G | Cumene dioxygenas from pseudomonas fluorescens 29 homologues found by PSI-BLAST 30
28 1jtp:L | The degenerate interfaces in antigen-antibody complexes 122 homologues found by PSI-BLAST 123
29 2pcc:A | Yeast CCP complex with compnd  yeast ISO-1-cytochrome C 35 homologues found by PSI-BLAST 36
30 2pcc:B | Yeast CCP complex with compnd  yeast ISO-1-cytochrome C 146 homologues found by PSI-BLAST 147
31 1qla:F | Phosphotransferase 37 homologues found by PSI-BLAST 38
32 1qla:G | Phosphotransferase 107 homologues found by PSI-BLAST 108
33 1tx4:A | RHO/RHOGAP/GDP(DOT)ALF4 complex 42 homologues found by PSI-BLAST 43
34 1tx4:B | RHO/RHOGAP/GDP(DOT)ALF4 complex 150 homologues found by PSI-BLAST 151
* The training data was prepared by collecting proteins in the release 48.3 of UniProtKB/Swiss-Prot with the same cross-references of the query protein to the secondary databases, such as InterPro, Pfam, or PROSITE.
To view the mining results of a dataset, click one of the following result page link or large image link.

  

Large Image(1s9i:A) | Result Page

P36507(MP2K2_HUMAN) | Protein kinases ATP-binding region signature

Query Protein : P36507 (MP2K2_HUMAN) Dual specificity mitogen-activated protein kinase kinase 2 (EC 2.7.1.-) (MAP kinase kinase 2) (MAPKK 2) (ERK activator kinase 2) (MAPK/ERK kinase 2) (MEK2).

Training Dataset : 1910 entries are found to match the cross-reference PS00107, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00107, Protein kinases ATP-binding region signature.

Selected Pattern (support threshold 30%):
Maximum Size Pattern | ID: 186 | Support : 577 | Size : 15
                                                          [GO TOP]

Large Image(1eft) | Result Page

Q01698(EFTU_THEAQ) | GTP-binding elongation factors signature

Query Protein :  Q01698 (EFTU_THEAQ) Elongation factor Tu (EF-Tu).

Training Dataset : 932 entries are found to match the cross-reference PS00107, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00301, GTP-binding elongation factors signature.

Selected Pattern (support threshold 80%):
Maximum Size Pattern | ID: 167 | Support : 813 | Size : 13

                                                          [GO TOP]

Large Image(1hpm) | Result Page

P19120 (HSP7C_BOVIN) | Heat shock protein Hsp70

Query Protein : P19120 (HSP7C_BOVIN) Heat shock cognate 71 kDa protein.

Training Dataset : 932 entries are found to match the cross-reference IPR001023, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> InterPro IPR001023, Heap shock protein Hsp 70.

Selected Pattern (support threshold 83%):
Maximum Size Pattern | ID: 650 | Support : 411 | Size : 12
                                                          [GO TOP]

Large Image(1a27) | Result Page

P51656 (DHB1_MOUSE) | Short-chain dehydrogenases/reductases

Query Protein : P51656 (DHB1_MOUSE) Estradiol 17-beta-dehydrogenase 1 (EC 1.1.1.62) (17-beta-HSD 1) (17- beta-hydroxysteroid dehydrogenase 1).

Training Dataset : 494 entries are found to match the cross-reference PF00106,PS00061, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam : PF00106 short chain dehydrogenase.
> PROSITE : PS00061 Short-chain dehydrogenases/reductases family signature.

Selected Pattern (support threshold 20%):
Maximum Size Pattern | ID: 413 | Support : 135 | Size : 10
                                                          [GO TOP]


Large Image(1euq) | Result Page

P00962 (SYQ_ECOLI) | tRNA synthetases class I (E and Q), catalytic domain

Query Protein : P00962 (SYQ_ECOLI) Glutaminyl-tRNA synthetase.

Training Dataset : 346 entries are found to match the cross-reference PF00749 out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam PF00749, tRNA synthetases class I (E and Q), catalytic domain.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 409 | Support : 154 | Size : 16
                                                          [GO TOP]

Large Image(1amu:A) | Result Page

Q01574 (ACS1_YEAST) | Putative AMP-binding domain signature

Query Protein : Q01574 (ACS1_YEAST) ACS1_YEAST Acetyl-coenzyme A synthetase 1 (EC 6.2.1.1) (Acetate--CoA ligase 1) (Acyl-activating enzyme 1).

Training Dataset : 282 entries are found to match the cross-reference PF00455 out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00455, Putative AMP-binding domain signature.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 552 | Support : 117 | Size : 18
                                                          [GO TOP]

(a)Large Image(1qfy:A)

(b)Large Image(1qfy:A)

Result Page

P10933 (FENR1_PEA)|Oxidoreductase FAD/NAD(P)-binding

Query Protein : P10933 (FENR1_PEA) Ferredoxin--NADP reductase, leaf isozyme, chloroplast precursor (EC 1.18.1.2) (FNR).

Training Dataset : 280 entries are found to match the cross-reference IPR001433 out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> InterPro IPR001433, Oxidoreductase FAD/NAD(P)-binding.

Selected Pattern (support threshold 20%):
(a) Maximum Size Pattern | ID: 1265 | Support : 57 | Size : 21
(b) Large Support Pattern | ID: 1854 | Support : 147 | Size : 9
                                                          [GO TOP]

Large Support Image(1exk

Large Size Image(1exk)

Result Page

P08622 (DNAJ_ECOLI)|CXXCXGXG DnaJ Central Domain

Query Protein : P08622 (DNAJ_ECOLI) Chaperone protein dnaJ (Heat shock protein J) (HSP40).

Training Dataset : 275 entries are found to match the cross-reference PF00684,PS00637, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam PF00684, DnaJ central domain (4 repeats).
> PROSITE PS00637, CXXCXGXG dnaJ domain signature.

Selected Pattern (support threshold 80%):
Large Size Pattern | ID: 31 | Support : 220 | Size : 16
Large Support Pattern | ID: 49 | Support : 264 | Size : 8

                                                          [GO TOP]

Large Image(1a7t:A) | Result Page

P25910 (BLAB_BACFR)|Metallo-beta-lactamase superfamily

Query Protein : P25910 (BLAB_BACFR) Beta-lactamase type II precursor (EC 3.5.2.6) (Penicillinase) (Cephalosporinase) (Imipenem-cefoxitin hydrolyzing enzyme).

Training Dataset : 267 entries are found to match the cross-reference PF00753, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam : PF00753, Metallo-beta-lactamase superfamily.

Selected Pattern (support threshold 90%):
Maximum Size Pattern | ID: 663 | Support : 13 | Size : 23
                                                          [GO TOP]

Large Image(1vpe) | Result Page

P36204 (PGKT_THEMA)|Phosphoglycerate kinase signature

Query Protein : P36204 (PGKT_THEMA) Bifunctional PGK/TIM [Includes: Phosphoglycerate kinase (EC 2.7.2.3); Triosephosphate isomerase (EC 5.3.1.1) (TIM) (Triose-phosphate isomerase)].

Training Dataset : 251 entries are found to match the cross-reference PS00111, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE : PS00111, Phosphoglycerate kinase signature.

Selected Pattern (support threshold 90%):
Maximum Size Pattern | ID: 504 | Support : 226 | Size : 19
                                                          [GO TOP]

Large Image(1zin) | Result Page

P27142 (KAD_BACST)|Adenylate kinase signature

Query Protein : P27142 (KAD_BACST) KAD_BACST Adenylate kinase (EC 2.7.4.3) (ATP-AMP transphosphorylase) (AK).

Training Dataset : 243 entries are found to match the cross-reference PS00113, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE : PS00113, Adenylate kinase signature.

Selected Pattern (support threshold 70%):
Maximum Size Pattern | ID: 764 | Support : 171 | Size : 15
                                                          [GO TOP]

Large Image(1b4s:A) | Result Page

P22887 (NDKC_DICDI)|Nucleoside diphosphate kinases active site

Query Protein : P22887 (NDKC_DICDI) Nucleoside diphosphate kinase, cytosolic (EC 2.7.4.6) (NDK) (NDP kinase).

Training Dataset : 237 entries are found to match the cross-reference PS00469, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE : PS00469, Nucleoside diphosphate kinases active site.

Selected Pattern (support threshold 70%):
Maximum Size Pattern | ID: 96 | Support : 175 | Size : 14
                                                          [GO TOP]

Large Image(1dkg:A) | Result Page

P09372 (GRPE_ECOLI)|grpE Protein Signature

Query Protein : P09372 (GRPE_ECOLI) Protein grpE (HSP-70 cofactor) (Heat shock protein B25.3) (HSP24).

Training Dataset : 195 entries are found to match the cross-reference PS01071, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS01071, grpE Protein Signature.

Selected Pattern (support threshold 30%):
Maximum Size Pattern | ID: 3302 | Support : 63 | Size : 17
                                                          [GO TOP]

Large Image(1h73) | Result Page

Q58504 (KHSE_METJA)|GHMP kinases putative ATP-binding domain

Query Protein : Q58504 (KHSE_METJA) Homoserine kinase (EC 2.7.1.39) (HSK) (HK).

Training Dataset : 191 entries are found to match the cross-reference PS00627, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00627, GHMP kinases putative ATP-binding domain.

Selected Pattern (support threshold 30%):
Maximum Size Pattern | ID: 60 | Support : 99 | Size : 16
                                                          [GO TOP]

Large Image(1kkh) | Result Page

Q58487 (KIME_METJA)|GHMP kinases putative ATP-binding domain

Query Protein : Q58487 (KIME_METJA) Mevalonate kinase (EC 2.7.1.36) (MK).

Training Dataset : 191 entries are found to match the cross-reference PS00627, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00627, GHMP kinases putative ATP-binding domain.

Selected Pattern (support threshold 10%):
Maximum Size Pattern | ID: 511 | Support : 19 | Size : 19
                                                          [GO TOP]

Large Image(1h5r:A) | Result Page 

P37744 (RMLA1_ECOLI)|NTP_transferase

Query Protein : P37744 (RMLA1_ECOLI) Glucose-1-phosphate thymidylyltransferase 1 (EC 2.7.7.24) (dTDP- glucose synthase 1) (dTDP-glucose pyrophosphorylase 1) (G1P-TT 1).

Training Dataset : 188 entries are found to match the cross-reference PF00483, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam PF00483, NTP_transferase;1.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 166 | Support : 85 | Size : 18
                                                          [GO TOP]

Large Image(117e:A) | Result Page 

P00817 (IPYR_YEAST)|Inorganic pyrophosphatase signature

Query Protein : P00817 (IPYR_YEAST) Inorganic pyrophosphatase (EC 3.6.1.1) (Pyrophosphate phospho- hydrolase) (PPase).

Training Dataset : 181 entries are found to match the cross-reference PS00387, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00387, Inorganic pyrophosphatase signature.

Selected Pattern (support threshold 70%):
Maximum Size Pattern | ID: 1 | Support : 84 | Size : 10
                                                          [GO TOP]

Large Image(1dbt:AB) | Result Page 

P25971 (PYRF_BACSU) | Orotidine 5'-phosphate decarboxylase

Query Protein : P25971 (PYRF_BACSU) PYRF_BACSU Orotidine 5'-phosphate decarboxylase (EC 4.1.1.23) (OMP decarboxylase) (OMPDCase) (OMPdecase).

Training Dataset : 178 entries are found to match the cross-reference PF00215, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam PF00215, Orotidine 5'-phosphate decarboxylase / HUMPS family.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 226 | Support : 71 | Size : 20
                                                          [GO TOP]

Large Image(1lvg) | Result Page 

Q64520 (KGUA_MOUSE)|Guanylate kinase

Query Protein : Q64520 (KGUA_MOUSE) Guanylate kinase (EC 2.7.4.8) (GMP kinase).

Training Dataset : 141 entries are found to match the cross-reference PF00625, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> Pfam PF00625, Guanylate kinase.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 226 | Support : 71 | Size : 20
                                                          [GO TOP]

 

Large Image(1p50) | Result Page

P51541 (KARG_LIMPO)|ATP:guanido phosphotransferases active site

Query Protein : P51541 (KARG_LIMPO) Arginine kinase (EC 2.7.3.3) (AK).

Training Dataset : 81 entries are found to match the cross-reference PS00112, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00112, ATP:guanido phosphotransferases active site.

Selected Pattern (support threshold 90%):
Maximum Size Pattern | ID: 7 | Support : 72 | Size : 18
                                                          [GO TOP]

 

Large Image(1fkw) | Result Page

P03958 (ADA_MOUSE)|Adenosine and AMP deaminase signature

Query Protein : P03958 (KARG_LIMPO) Adenosine deaminase (EC 3.5.4.4) (Adenosine aminohydrolase).

Training Dataset : 59 entries are found to match the cross-reference PS00485, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00485, Adenosine and AMP deaminase signature.

Selected Pattern (support threshold 90%):
Maximum Size Pattern | ID: 2 | Support : 41 | Size : 10
                                                          [GO TOP]

Large Image(1do8:A) | Result Page

P23368 (MAOM_HUMAN)|Malic enzymes signature

Query Protein : P23368 (MAOM_HUMAN) NAD-dependent malic enzyme, mitochondrial precursor (EC 1.1.1.38) (NAD-ME) (Malic enzyme 2).

Training Dataset : 43 entries are found to match the cross-reference PS00331, out of 196277 entries in the release 48.3 of UniProtKB/Swiss-Prot.
> PROSITE PS00331, Malic enzymes signature.

Selected Pattern (support threshold 80%):
Maximum Size Pattern | ID: 64 | Support : 34 | Size : 18
                                                          [GO TOP]

Large Image(1agr:A) | Result Page

1arg:A | Complex of ALF4-ACTIVATED GI-ALPHA-1 with RGS4

Query Protein : larg:A sequence |COMPLEX OF ALF4-ACTIVATED GI-ALPHA-1 WITH RGS4 | P10824 Guanine nucleotide-binding protein G(i).

Training Dataset : PSI-BLAST found 241 hits. The mining process will be executed on the following unique 100 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 48% average identity.

Selected Pattern (support threshold 100%):
Maximum Size Pattern | ID: 1 | Support : 101 | Size : 32
                                                          [GO TOP]

Large Image(1wql:G) | Result Page

1wql:G | Cumene dioxygenas from Pseudomonas fluorescens IP01

Query Protein : lwql:G sequence | Cumene dioxygenase (cumA1A2) from Pseudomonas fluorescens IP01 | P20936 Ras GTPase-activating protein 1.

Training Dataset : PSI-BLAST found 30 hits. The mining process will be executed on the following unique 29 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 31% average identity.

Selected Pattern (support threshold 40%):
Maximum Size Pattern | ID: 158 | Support : 12 | Size : 20
                                                          [GO TOP]

Large Image(1ds6:A) | Result Page

1ds6:A |A RAC-RHOGDI complex

Query Protein : lds6:A sequence | CRYSTAL STRUCTURE OF A RAC-RHOGDI COMPLEX | P15153 Ras-related C3 botulinum toxin substrate 2.

Training Dataset : PSI-BLAST found 250 hits. The mining process will be executed on the following unique 100 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 42% average identity.

Selected Pattern (support threshold 90%):
Maximum Size Pattern | ID: 231 | Support : 90 | Size : 13
                                                          [GO TOP]

 

Large Image(1dtd:A) | Result Page

1dtd: A | The leech and human carboxypeptidase complex

Query Protein : 1dtd:A Sequence | CRYSTAL STRUCTURE OF THE COMPLEX BETWEEN THE LEECH CARBOXYPEPTIDASE INHIBITOR AND THE HUMAN CARBOXYPEPTIDASE A2 (LCI-CPA2) | P48052 Carboxypeptidase A2 precursor.

Training Dataset : PSI-BLAST found 53 hits. The mining process will be executed on the following unique 47 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 39% average identity.

Selected Pattern (support threshold 60%):
Maximum Size Pattern | ID: 25 | Support : 28 | Size : 34
                                                          [GO TOP]

Large Image(1wql:R) | Result Page

1wql:R | GAPETTE

Query Protein : lwql:R sequence | GAPETTE.

Training Dataset : PSI-BLAST found 30 hits. The mining process will be executed on the following unique 100 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 46% average identity.

Selected Pattern (support threshold 85%):
Maximum Size Pattern | ID: 613 | Support : 86 | Size : 18

                                                          [GO TOP]

Large Image(1jtp:L) | PDB View

1jtp:L| The degenerate interfaces in antigen-antibody complexes

Query Protein : 1JTP:L | Lysozyme C.

Training Dataset : PSI-BLAST found 123 hits. The mining process executed on the unique 122 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 55% average identity.

Resultant Link (support threshold 70%):
http://biominer.cse.yzu.edu.tw/temp/Uym8dsT50XYZOz50/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:237 | Support:87 | Hits:88 | Size:16 | Blocks:5


 

 

Image Under Construction

Large Image(2pcc:A) | PDB View

2pcc:A| Yeast cytochrome C peroxidase (CCP) complex with compnd  yeast ISO-1-cytochrome C.

Query Protein : 2PCC:A | Yeast cytochrome C peroxidase (CCP)

Training Dataset : PSI-BLAST found 96 hits. The mining process executed on the unique 35 protein sequences under the E-value [0.01] and identity [30%~99%] constraints with 43% average identity.

Resultant Link (support threshold 80%):
http://biominer.cse.yzu.edu.tw/temp/RodmatB28veUHy28/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:55 | Support:28 | Hits:28 | Size:24 | Blocks:6


 

 

Image Under Construction

Large Image(2pcc:B) | PDB View

2pcc:B| Yeast cytochrome C peroxidase (CCP) complex with compnd  yeast ISO-1-cytochrome C.

Query Protein : 2PCC:B | Yeast ISO-1-cytochrome C.

Training Dataset : PSI-BLAST found 149 hits. The mining process executed on the unique 146 protein sequences under the E-value [0.01] and identity [30%~99%] constraints with 58% average identity.

Resultant Link (support threshold 79%):
http://biominer.cse.yzu.edu.tw/temp/58epTSl14vkV7A14/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:1222 | Support:116 | Hits:116 | Size:16 | Blocks:3

 

 

 

Image Under Construction

Large Image(1gla:F) | PDB View

1qla:F| Phosphotransferase, glycerol kinase complex with glycerol and the (escherichia coli) glucose-specific factor III (III-GLC)

Query Protein : 1gla:F

Training Dataset : PSI-BLAST found 39 hits. The mining process executed on the unique 37 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 45% average identity.

Resultant Link (support threshold 67%):
http://biominer.cse.yzu.edu.tw/temp/Q7mEEV025orqZe25/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:56 | Support:25 | Hits:25 | Size:14 | Blocks:3

 

 

 

Image Under Construction

Large Image(1gla:G) | PDB View

1qla:G| Phosphotransferase, glycerol kinase complex with glycerol and the (escherichia coli) glucose-specific factor III (III-GLC)

Query Protein : 1gla:G

Training Dataset : PSI-BLAST found 141 hits. The mining process executed on the unique 107 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 58% average identity.

Resultant Link (support threshold 98%):
http://biominer.cse.yzu.edu.tw/temp/3UOvquB41CKB0641/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:30 | Support:105 | Hits:107 | Size:23 | Blocks:5

 

 

 

Image Under Construction

Large Image(1tx4:A) | PDB View

1tx4:A| RHO/RHOGAP/GDP(DOT)ALF4 complex

Query Protein : 1tx4:A | P50-RHOGAP.

Training Dataset : PSI-BLAST found 93 hits. The mining process executed on the unique 42 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 31% average identity.

Resultant Link (support threshold 41%):
http://biominer.cse.yzu.edu.tw/temp/SJnFLqd35EMb0C35/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:515 | Support:17 | Hits:17 | Size:14 | Blocks:4

 

 

 

Image Under Construction

Large Image(1tx4:B) | PDB View

1tx4:B | RHO/RHOGAP/GDP(DOT)ALF4 complex

Query Protein : 1tx4:B | Transforming protein RHOA.

Training Dataset : PSI-BLAST found 250 hits. The mining process executed on the unique 150 protein sequences under the E-value [0.01] and identity [10%~99%] constraints with 43% average identity.

Resultant Link (support threshold 66%):
http://biominer.cse.yzu.edu.tw/temp/mUMPjjX55zFd1b55/results.html

Selected Pattern (Maximum Size Pattern):
>> ID:2681 | Support:99 | Hits:99 | Size:17 | Blocks:4

Any comments can be referred to Chien-Yu Chen or Chen-Ming Hsu.