研究者データベース

遠藤 俊徳(エンドウ トシノリ)
情報科学研究院 生命人間情報科学部門 バイオインフォマティクス分野
教授

基本情報

所属

  • 情報科学研究院 生命人間情報科学部門 バイオインフォマティクス分野

職名

  • 教授

学位

  • 博士(理学)(総合研究大学院大学)

ホームページURL

科研費研究者番号

  • 00323692

J-Global ID

研究キーワード

  • 機械学習による機能予測   遺伝子構造・機能   情報生物学   

研究分野

  • ライフサイエンス / 進化生物学
  • ライフサイエンス / 多様性生物学、分類学
  • ライフサイエンス / 機能生物化学
  • ライフサイエンス / ゲノム生物学
  • ナノテク・材料 / 生物分子化学
  • 情報通信 / 生命、健康、医療情報学

職歴

  • 2004年04月 - 現在 北海道大学 大学院情報科学研究科 教授
  • 2017年04月 - 2018年03月 北海道大学 情報学企画委員長
  • 2014年04月 - 2015年03月 北海道大学 大学院情報科学研究科生命人間情報科学 専攻長
  • 2014年04月 - 2015年03月 北海道大学 工学部情報エレクトロニクス 学科長
  • 2008年04月 - 2009年03月 北海道大学 大学院情報科学研究科生命人間情報科学 専攻長
  • 2000年04月 - 2004年03月 東京医科歯科大学 難治疾患研究所 講師
  • 2000年 - 2004年 Lecturer,Tokyo Medical and Dental University
  • 1999年05月 - 2000年03月 理化学研究所 ゲノム科学総合研究センター 研究員
  • 1999年 - 2000年 Researcher,RIKEN GSC
  • 1997年04月 - 1999年04月 日本学術振興会 特別研究員
  • 1997年 - 1999年 Researcher
  • 1996年04月 - 1997年03月 新エネルギー・産業技術開発機構 研究員
  • 1996年 - 1997年 Researcher,NEDO, Researcher

学歴

  • 1993年04月 - 1996年03月   総合研究大学院大学   生命科学研究科   遺伝学専攻
  •         - 1996年   総合研究大学院大学
  • 1990年04月 - 1993年03月   東京大学   大学院理学系研究科   生物化学専攻
  •         - 1993年   東京大学
  •         - 1992年   東京大学
  • 1986年04月 - 1990年03月   東京大学   理学部   生物化学科
  •         - 1990年   東京大学

所属学協会

  • 日本進化学会   日本情報処理学会   日本バイオインフォマティクス学会   生き物文化誌学会   Society for Molecular Biology and Evolution   日本遺伝学会   Japan Society of Bioinformatics   Society of Biosophia Studies   Society for Molecular Biology and Evolution   Japanese Society for Genetics   

研究活動情報

論文

  • Keisuke Ueno, Katsuhiko Mineta, Kimihito Ito, Toshinori Endo
    BMC STRUCTURAL BIOLOGY 12 5  2012年04月 [査読有り][通常論文]
     
    Background: Structural genomics approaches, particularly those solving the 3D structures of many proteins with unknown functions, have increased the desire for structure-based function predictions. However, prediction of enzyme function is difficult because one member of a superfamily may catalyze a different reaction than other members, whereas members of different superfamilies can catalyze the same reaction. In addition, conformational changes, mutations or the absence of a particular catalytic residue can prevent inference of the mechanism by which catalytic residues stabilize and promote the elementary reaction. A major hurdle for alignment-based methods for prediction of function is the absence (despite its importance) of a measure of similarity of the physicochemical properties of catalytic sites. To solve this problem, the physicochemical features radially distributed around catalytic sites should be considered in addition to structural and sequence similarities. Results: We showed that radial distribution functions (RDFs), which are associated with the local structural and physicochemical properties of catalytic active sites, are capable of clustering oxidoreductases and transferases by function. The catalytic sites of these enzymes were also characterized using the RDFs. The RDFs provided a measure of the similarity among the catalytic sites, detecting conformational changes caused by mutation of catalytic residues. Furthermore, the RDFs reinforced the classification of enzyme functions based on conventional sequence and structural alignments. Conclusions: Our results demonstrate that the application of RDFs provides advantages in the functional classification of enzymes by providing information about catalytic sites.
  • Mia Nakachi, Ayako Nakajima, Mamoru Nomura, Kouki Yonezawa, Keisuke Ueno, Toshinori Endo, Kazuo Inaba
    MOLECULAR REPRODUCTION AND DEVELOPMENT 78 7 529 - 549 2011年07月 [査読無し][通常論文]
     
    In this study, we performed extensive proteomic analysis of sperm from the ascidian Ciona intestinalis. Sperm were fractionated into heads and flagella, followed by further separation into Triton X-100-soluble and -insoluble fractions. Proteins from each fraction and whole sperm were separated by isoelectric focusing using two different pH ranges, followed by SDS-PAGE at two different polyacrylamide concentrations. In total, 1,294 protein spots representing 304 non-redundant proteins were identified by mass spectrometry (MALDI-TOF). On comparison of the proteins in each fraction, we were able to identify the proteins specific to different sperm compartments. Further comparison with the testis proteome allowed the pairing of proteins with sperm-specific functions. Together with information on gene expression in developing embryos and adult tissues, these results provide insight into novel cellular and functional aspects of sperm proteins, such as distinct localization of actin isoforms, novel Ca-2 vertical bar-binding proteins in axonemes, localization of testis-specific serine/threonine kinase, and the presence of G-protein coupled signaling and ubiquitin pathway in sperm flagella. Mol. Reprod. Dev. 78: 529-549, 2011. (C) 2011 Wiley-Liss, Inc.
  • Mia Nakachi, Ayako Nakajima, Mamoru Nomura, Kouki Yonezawa, Keisuke Ueno, Toshinori Endo, Kazuo Inaba
    MOLECULAR REPRODUCTION AND DEVELOPMENT 78 7 529 - 549 2011年07月 [査読有り][通常論文]
     
    In this study, we performed extensive proteomic analysis of sperm from the ascidian Ciona intestinalis. Sperm were fractionated into heads and flagella, followed by further separation into Triton X-100-soluble and -insoluble fractions. Proteins from each fraction and whole sperm were separated by isoelectric focusing using two different pH ranges, followed by SDS-PAGE at two different polyacrylamide concentrations. In total, 1,294 protein spots representing 304 non-redundant proteins were identified by mass spectrometry (MALDI-TOF). On comparison of the proteins in each fraction, we were able to identify the proteins specific to different sperm compartments. Further comparison with the testis proteome allowed the pairing of proteins with sperm-specific functions. Together with information on gene expression in developing embryos and adult tissues, these results provide insight into novel cellular and functional aspects of sperm proteins, such as distinct localization of actin isoforms, novel Ca-2 vertical bar-binding proteins in axonemes, localization of testis-specific serine/threonine kinase, and the presence of G-protein coupled signaling and ubiquitin pathway in sperm flagella. Mol. Reprod. Dev. 78: 529-549, 2011. (C) 2011 Wiley-Liss, Inc.
  • Katsuhiko Mineta, Yasuko Yamamoto, Yuji Yamazaki, Hiroo Tanaka, Yukiyo Tada, Kuniaki Saito, Atsushi Tamura, Michihiro Igarashi, Toshinori Endo, Kosei Takeuchi, Sachiko Tsukita
    FEBS LETTERS 585 4 606 - 612 2011年02月 [査読有り][通常論文]
     
    Claudins (Cldn) are essential membrane proteins of tight junctions (TJs), which form the paracellular permselective barrier. They are produced by a multi-gene family of 24 reported members in mouse and human. Based on a comprehensive search combined with phylogenetic analyses, we identified three novel claudins (claudin-25, -26, and -27). Quantitative RT-PCR revealed that the three novel claudins were expressed in a tissue- and/or developmental stage-dependent manner. Claudins-25 and -26, but not claudin-27, were immunofluorescently localized to TJs when exogenously expressed in cultured MDCK and Eph epithelial cell lines. These findings expand the claudin family to include at least 27 members. Structured summary: Claudin-25 and ZO-1 colocalize by fluorescence microscopy (View interaction) ZO-1 and Claudin-26 colocalize by fluorescence microscopy (View interaction) (C) 2011 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
  • Toshinori Endo, Keisuke Ueno, Kouki Yonezawa, Katsuhiko Mineta, Kohji Hotta, Yutaka Satou, Lixy Yamada, Michio Ogasawara, Hiroki Takahashi, Ayako Nakajima, Mia Nakachi, Mamoru Nomura, Junko Yaguchi, Yasunori Sasakura, Chisato Yamasaki, Miho Sera, Akiyasu C. Yoshizawa, Tadashi Imanishi, Hisaaki Taniguchi, Kazuo Inaba
    NUCLEIC ACIDS RESEARCH 39 suppl D807 - D814 2011年01月 [査読無し][通常論文]
     
    The Ciona intestinalis protein database (CIPRO) is an integrated protein database for the tunicate species C. intestinalis. The database is unique in two respects: first, because of its phylogenetic position, Ciona is suitable model for understanding vertebrate evolution; and second, the database includes original large-scale transcriptomic and proteomic data. Ciona intestinalis has also been a favorite of developmental biologists. Therefore, large amounts of data exist on its development and morphology, along with a recent genome sequence and gene expression data. The CIPRO database is aimed at collecting those published data as well as providing unique information from unpublished experimental data, such as 3D expression profiling, 2D-PAGE and mass spectrometry-based large-scale analyses at various developmental stages, curated annotation data and various bioinformatic data, to facilitate research in diverse areas, including developmental, comparative and evolutionary biology. For medical and evolutionary research, homologs in humans and major model organisms are intentionally included. The current database is based on a recently developed KH model containing 36 034 unique sequences, but for higher usability it covers 89 683 all known and predicted proteins from all gene models for this species. Of these sequences, more than 10 000 proteins have been manually annotated. Furthermore, to establish a community-supported protein database, these annotations are open to evaluation by users through the CIPRO website. CIPRO 2.5 is freely accessible at http://cipro.ibio.jp/2.5.
  • Predicted expansion of the claudin multigene family
    FEBS Letter doi:10.1016/j.febslet.2011.01.028  2011年 [査読無し][通常論文]
  • Toshinori Endo, Keisuke Ueno, Kouki Yonezawa, Katsuhiko Mineta, Kohji Hotta, Yutaka Satou, Lixy Yamada, Michio Ogasawara, Hiroki Takahashi, Ayako Nakajima, Mia Nakachi, Mamoru Nomura, Junko Yaguchi, Yasunori Sasakura, Chisato Yamasaki, Miho Sera, Akiyasu C. Yoshizawa, Tadashi Imanishi, Hisaaki Taniguchi, Kazuo Inaba
    NUCLEIC ACIDS RESEARCH 39 D807 - D814 2011年01月 [査読有り][通常論文]
     
    The Ciona intestinalis protein database (CIPRO) is an integrated protein database for the tunicate species C. intestinalis. The database is unique in two respects: first, because of its phylogenetic position, Ciona is suitable model for understanding vertebrate evolution; and second, the database includes original large-scale transcriptomic and proteomic data. Ciona intestinalis has also been a favorite of developmental biologists. Therefore, large amounts of data exist on its development and morphology, along with a recent genome sequence and gene expression data. The CIPRO database is aimed at collecting those published data as well as providing unique information from unpublished experimental data, such as 3D expression profiling, 2D-PAGE and mass spectrometry-based large-scale analyses at various developmental stages, curated annotation data and various bioinformatic data, to facilitate research in diverse areas, including developmental, comparative and evolutionary biology. For medical and evolutionary research, homologs in humans and major model organisms are intentionally included. The current database is based on a recently developed KH model containing 36 034 unique sequences, but for higher usability it covers 89 683 all known and predicted proteins from all gene models for this species. Of these sequences, more than 10 000 proteins have been manually annotated. Furthermore, to establish a community-supported protein database, these annotations are open to evaluation by users through the CIPRO website. CIPRO 2.5 is freely accessible at http://cipro.ibio.jp/2.5.
  • Yutaka Satou, Katsuhiko Mineta, Michio Ogasawara, Yasunori Sasakura, Eiichi Shoguchi, Keisuke Ueno, Lixy Yamada, Jun Matsumoto, Jessica Wasserscheid, Ken Dewar, Graham B. Wiley, Simone L. Macmil, Bruce A. Roe, Robert W. Zeller, Kenneth E. M. Hastings, Patrick Lemaire, Erika Lindquist, Toshinori Endo, Kohji Hotta, Kazuo Inaba
    GENOME BIOLOGY 9 10 R152  2008年 [査読無し][通常論文]
     
    Background: The draft genome sequence of the ascidian Ciona intestinalis, along with associated gene models, has been a valuable research resource. However, recently accumulated expressed sequence tag (EST)/cDNA data have revealed numerous inconsistencies with the gene models due in part to intrinsic limitations in gene prediction programs and in part to the fragmented nature of the assembly. Results: We have prepared a less-fragmented assembly on the basis of scaffold-joining guided by paired-end EST and bacterial artificial chromosome (BAC) sequences, and BAC chromosomal in situ hybridization data. The new assembly (115.2 Mb) is similar in length to the initial assembly (116.7 Mb) but contains 1,272 (approximately 50%) fewer scaffolds. The largest scaffold in the new assembly incorporates 95 initial-assembly scaffolds. In conjunction with the new assembly, we have prepared a greatly improved global gene model set strictly correlated with the extensive currently available EST data. The total gene number (15,254) is similar to that of the initial set (15,582), but the new set includes 3,330 models at genomic sites where none were present in the initial set, and 1,779 models that represent fusions of multiple previously incomplete models. In approximately half, 5'-ends were precisely mapped using 5'-full-length ESTs, an important refinement even in otherwise unchanged models. Conclusion: Using these new resources, we identify a population of non-canonical (non-GT-AG) introns and also find that approximately 20% of Ciona genes reside in operons and that operons contain a high proportion of single-exon genes. Thus, the present dataset provides an opportunity to analyze the Ciona genome much more precisely than ever.
  • Chisato Yamasaki, Katsuhiko Murakami, Yasuyuki Fujii, Yoshiharu Sato, Erimi Harada, Jun-Ichi Takeda, Takayuki Taniya, Ryuichi Sakate, Shingo Kikugawa, Makoto Shimada, Motohiko Tanino, Kanako O. Koyanagi, Roberto A. Barrero, Craig Gough, Hong-Woo Chun, Takuya Habara, Hideki Hanaoka, Yosuke Hayakawa, Phillip B. Hilton, Yayoi Kaneko, Masako Kanno, Yoshihiro Kawahara, Toshiyuki Kawamura, Akihiro Matsuya, Naoki Nagata, Kensaku Nishikata, Akiko Ogura Noda, Shin Nurimoto, Naomi Saichi, Hiroaki Sakai, Ryoko Sanbonmatsu, Rie Shiba, Mami Suzuki, Kazuhiko Takabayashi, Aiko Takahashi, Takuro Tamura, Masayuki Tanaka, Susumu Tanaka, Fusano Todokoro, Kaori Yamaguchi, Naoyuki Yamamoto, Toshihisa Okido, Jun Mashima, Aki Hashizume, Lihua Jin, Kyung-Bum Lee, Yi-Chueh Lin, Asami Nozaki, Katsunaga Sakai, Masahito Tada, Satoru Miyazaki, Takashi Makino, Hajime Ohyanagi, Naoki Osato, Nobuhiko Tanaka, Yoshiyuki Suzuki, Kazuho Ikeo, Naruya Saitou, Hideaki Sugawara, Claire O'Donovan, Tamara Kulikova, Eleanor Whitfield, Brian Halligan, Mary Shimoyama, Simon Twigger, Kei Yura, Kouichi Kimura, Tomohiro Yasuda, Tetsuo Nishikawa, Yutaka Akiyama, Chie Motono, Yuri Mukai, Hideki Nagasaki, Makiko Suwa, Paul Horton, Reiko Kikuno, Osamu Ohara, Doron Lancet, Eric Eveno, Esther Graudens, Sandrine Imbeaud, Marie Anne Debily, Yoshihide Hayashizaki, Clara Amid, Michael Han, Andreas Osanger, Toshinori Endo, Michael A. Thomas, Mika Hirakawa, Wojciech Makalowski, Mitsuteru Nakao, Nam-Soon Kim, Hyang-Sook Yoo, Sandro J. De Souza, Maria de Fatima Bonaldo, Yoshihito Niimura, Vladimir Kuryshev, Ingo Schupp, Stefan Wiemann, Matthew Bellgard, Masafumi Shionyu, Libin Jia, Danielle Thierry-Mieg, Jean Thierry-Mieg, Lukas Wagner, Qinghua Zhang, Mitiko Go, Shinsei Minoshima, Masafumi Ohtsubo, Kousuke Hanada, Peter Tonellato, Takao Isogai, Ji Zhang, Boris Lenhard, Sangsoo Kim, Zhu Chen, Ursula Hinz, Anne Estreicher, Kenta Nakai, Izabela Makalowska, Winston Hide, Nicola Tiffin, Laurens Wilming, Ranajit Chakraborty, Marcelo Bento Soares, Maria Luisa Chiusano, Yutaka Suzuki, Charles Auffray, Yumi Yamaguchi-Kabata, Takeshi Itoh, Teruyoshi Hishiki, Satoshi Fukuchi, Ken Nishikawa, Sumio Sugano, Nobuo Nomura, Yoshio Tateno, Tadashi Imanishi, Takashi Gojobori
    NUCLEIC ACIDS RESEARCH 36 D D793 - D799 2008年01月 [査読無し][通常論文]
     
    Here we report the new features and improvements in our latest release of the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/), a comprehensive annotation resource for human genes and transcripts. H-InvDB, originally developed as an integrated database of the human transcriptome based on extensive annotation of large sets of full-length cDNA (FLcDNA) clones, now provides annotation for 120 558 human mRNAs extracted from the International Nucleotide Sequence Databases (INSD), in addition to 54 978 human FLcDNAs, in the latest release H-InvDB_4.6. We mapped those human transcripts onto the human genome sequences (NCBI build 36.1) and determined 34 699 human gene clusters, which could define 34 057 (98.1%) protein-coding and 642 (1.9%) non-protein-coding loci; 858 (2.5%) transcribed loci overlapped with predicted pseudogenes. For all these transcripts and genes, we provide comprehensive annotation including gene structures, gene functions, alternative splicing variants, functional non-protein-coding RNAs, functional domains, predicted sub cellular localizations, metabolic pathways, predictions of protein 3D structure, mapping of SNPs and microsatellite repeat motifs, co-localization with orphan diseases, gene expression profiles, orthologous genes, proteinprotein interactions (PPI) and annotation for gene families. The current H-InvDB annotation resources consist of two main views: Transcript view and Locus view and eight sub-databases: the DiseaseInfo Viewer, H-ANGEL, the Clustering Viewer, G-integra, the TOPO Viewer, Evola, the PPI view and the Gene family/group.
  • Akihiro Matsuya, Ryuichi Sakate, Yoshihiro Kawahara, Kanako O. Koyanagi, Yoshiharu Sato, Yasuyuki Fujii, Chisato Yamasaki, Takuya Habara, Hajime Nakaoka, Fusano Todokoro, Kaori Yamaguchi, Toshinori Endo, Satoshi Oota, Wojciech Makalowski, Kazuho Ikeo, Yoshiyuki Suzuki, Kousuke Hanada, Katsuyuki Hashimoto, Momoki Hirai, Hisakazu Iwama, Naruya Saitou, Aiko T. Hiraki, Lihua Jin, Yayoi Kaneko, Masako Kanno, Katsuhiko Murakami, Akiko Ogura Noda, Naomi Saichi, Ryoko Sanbonmatsu, Mami Suzuki, Jun-Ichi Takeda, Masayuki Tanaka, Takashi Gojobori, Tadashi Imanishi, Takeshi Itoh
    NUCLEIC ACIDS RESEARCH 36 D787 - D792 2008年01月 [査読有り][通常論文]
     
    Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called Evola. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/.
  • Chisato Yamasaki, Katsuhiko Murakami, Yasuyuki Fujii, Yoshiharu Sato, Erimi Harada, Jun-Ichi Takeda, Takayuki Taniya, Ryuichi Sakate, Shingo Kikugawa, Makoto Shimada, Motohiko Tanino, Kanako O. Koyanagi, Roberto A. Barrero, Craig Gough, Hong-Woo Chun, Takuya Habara, Hideki Hanaoka, Yosuke Hayakawa, Phillip B. Hilton, Yayoi Kaneko, Masako Kanno, Yoshihiro Kawahara, Toshiyuki Kawamura, Akihiro Matsuya, Naoki Nagata, Kensaku Nishikata, Akiko Ogura Noda, Shin Nurimoto, Naomi Saichi, Hiroaki Sakai, Ryoko Sanbonmatsu, Rie Shiba, Mami Suzuki, Kazuhiko Takabayashi, Aiko Takahashi, Takuro Tamura, Masayuki Tanaka, Susumu Tanaka, Fusano Todokoro, Kaori Yamaguchi, Naoyuki Yamamoto, Toshihisa Okido, Jun Mashima, Aki Hashizume, Lihua Jin, Kyung-Bum Lee, Yi-Chueh Lin, Asami Nozaki, Katsunaga Sakai, Masahito Tada, Satoru Miyazaki, Takashi Makino, Hajime Ohyanagi, Naoki Osato, Nobuhiko Tanaka, Yoshiyuki Suzuki, Kazuho Ikeo, Naruya Saitou, Hideaki Sugawara, Claire O'Donovan, Tamara Kulikova, Eleanor Whitfield, Brian Halligan, Mary Shimoyama, Simon Twigger, Kei Yura, Kouichi Kimura, Tomohiro Yasuda, Tetsuo Nishikawa, Yutaka Akiyama, Chie Motono, Yuri Mukai, Hideki Nagasaki, Makiko Suwa, Paul Horton, Reiko Kikuno, Osamu Ohara, Doron Lancet, Eric Eveno, Esther Graudens, Sandrine Imbeaud, Marie Anne Debily, Yoshihide Hayashizaki, Clara Amid, Michael Han, Andreas Osanger, Toshinori Endo, Michael A. Thomas, Mika Hirakawa, Wojciech Makalowski, Mitsuteru Nakao, Nam-Soon Kim, Hyang-Sook Yoo, Sandro J. De Souza, Maria de Fatima Bonaldo, Yoshihito Niimura, Vladimir Kuryshev, Ingo Schupp, Stefan Wiemann, Matthew Bellgard, Masafumi Shionyu, Libin Jia, Danielle Thierry-Mieg, Jean Thierry-Mieg, Lukas Wagner, Qinghua Zhang, Mitiko Go, Shinsei Minoshima, Masafumi Ohtsubo, Kousuke Hanada, Peter Tonellato, Takao Isogai, Ji Zhang, Boris Lenhard, Sangsoo Kim, Zhu Chen, Ursula Hinz, Anne Estreicher, Kenta Nakai, Izabela Makalowska, Winston Hide, Nicola Tiffin, Laurens Wilming, Ranajit Chakraborty, Marcelo Bento Soares, Maria Luisa Chiusano, Yutaka Suzuki, Charles Auffray, Yumi Yamaguchi-Kabata, Takeshi Itoh, Teruyoshi Hishiki, Satoshi Fukuchi, Ken Nishikawa, Sumio Sugano, Nobuo Nomura, Yoshio Tateno, Tadashi Imanishi, Takashi Gojobori
    NUCLEIC ACIDS RESEARCH 36 D793 - D799 2008年01月 [査読有り][通常論文]
     
    Here we report the new features and improvements in our latest release of the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/), a comprehensive annotation resource for human genes and transcripts. H-InvDB, originally developed as an integrated database of the human transcriptome based on extensive annotation of large sets of full-length cDNA (FLcDNA) clones, now provides annotation for 120 558 human mRNAs extracted from the International Nucleotide Sequence Databases (INSD), in addition to 54 978 human FLcDNAs, in the latest release H-InvDB_4.6. We mapped those human transcripts onto the human genome sequences (NCBI build 36.1) and determined 34 699 human gene clusters, which could define 34 057 (98.1%) protein-coding and 642 (1.9%) non-protein-coding loci; 858 (2.5%) transcribed loci overlapped with predicted pseudogenes. For all these transcripts and genes, we provide comprehensive annotation including gene structures, gene functions, alternative splicing variants, functional non-protein-coding RNAs, functional domains, predicted sub cellular localizations, metabolic pathways, predictions of protein 3D structure, mapping of SNPs and microsatellite repeat motifs, co-localization with orphan diseases, gene expression profiles, orthologous genes, proteinprotein interactions (PPI) and annotation for gene families. The current H-InvDB annotation resources consist of two main views: Transcript view and Locus view and eight sub-databases: the DiseaseInfo Viewer, H-ANGEL, the Clustering Viewer, G-integra, the TOPO Viewer, Evola, the PPI view and the Gene family/group.
  • Akihiro Matsuya, Ryuichi Sakate, Yoshihiro Kawahara, Kanako O. Koyanagi, Yoshiharu Sato, Yasuyuki Fujii, Chisato Yamasaki, Takuya Habara, Hajime Nakaoka, Fusano Todokoro, Kaori Yamaguchi, Toshinori Endo, Satoshi Oota, Wojciech Makalowski, Kazuho Ikeo, Yoshiyuki Suzuki, Kousuke Hanada, Katsuyuki Hashimoto, Momoki Hirai, Hisakazu Iwama, Naruya Saitou, Aiko T. Hiraki, Lihua Jin, Yayoi Kaneko, Masako Kanno, Katsuhiko Murakami, Akiko Ogura Noda, Naomi Saichi, Ryoko Sanbonmatsu, Mami Suzuki, Jun-Ichi Takeda, Masayuki Tanaka, Takashi Gojobori, Tadashi Imanishi, Takeshi Itoh
    NUCLEIC ACIDS RESEARCH 36 D787 - D792 2008年01月 [査読有り][通常論文]
     
    Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called Evola. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/.
  • Yutaka Satou, Katsuhiko Mineta, Michio Ogasawara, Yasunori Sasakura, Eiichi Shoguchi, Keisuke Ueno, Lixy Yamada, Jun Matsumoto, Jessica Wasserscheid, Ken Dewar, Graham B. Wiley, Simone L. Macmil, Bruce A. Roe, Robert W. Zeller, Kenneth E. M. Hastings, Patrick Lemaire, Erika Lindquist, Toshinori Endo, Kohji Hotta, Kazuo Inaba
    GENOME BIOLOGY 9 10 R152  2008年 [査読有り][通常論文]
     
    Background: The draft genome sequence of the ascidian Ciona intestinalis, along with associated gene models, has been a valuable research resource. However, recently accumulated expressed sequence tag (EST)/cDNA data have revealed numerous inconsistencies with the gene models due in part to intrinsic limitations in gene prediction programs and in part to the fragmented nature of the assembly. Results: We have prepared a less-fragmented assembly on the basis of scaffold-joining guided by paired-end EST and bacterial artificial chromosome (BAC) sequences, and BAC chromosomal in situ hybridization data. The new assembly (115.2 Mb) is similar in length to the initial assembly (116.7 Mb) but contains 1,272 (approximately 50%) fewer scaffolds. The largest scaffold in the new assembly incorporates 95 initial-assembly scaffolds. In conjunction with the new assembly, we have prepared a greatly improved global gene model set strictly correlated with the extensive currently available EST data. The total gene number (15,254) is similar to that of the initial set (15,582), but the new set includes 3,330 models at genomic sites where none were present in the initial set, and 1,779 models that represent fusions of multiple previously incomplete models. In approximately half, 5'-ends were precisely mapped using 5'-full-length ESTs, an important refinement even in otherwise unchanged models. Conclusion: Using these new resources, we identify a population of non-canonical (non-GT-AG) introns and also find that approximately 20% of Ciona genes reside in operons and that operons contain a high proportion of single-exon genes. Thus, the present dataset provides an opportunity to analyze the Ciona genome much more precisely than ever.
  • T Imanishi, T Itoh, Y Suzuki, C O'Donovan, S Fukuchi, KO Koyanagi, RA Barrero, T Tamura, Y Yamaguchi-Kabata, M Tanino, K Yura, S Miyazaki, K Ikeo, K Homma, A Kasprzyk, T Nishikawa, M Hirakawa, J Thierry-Mieg, D Thierry-Mieg, J Ashurst, LB Jia, M Nakao, MA Thomas, N Mulder, Y Karavidopoulou, LH Jin, S Kim, T Yasuda, B Lenhard, E Eveno, Y Suzuki, C Yamasaki, J Takeda, C Gough, P Hilton, Y Fujii, H Sakai, S Tanaka, C Amid, M Bellgard, MD Bonaldo, H Bono, SK Bromberg, AJ Brookes, E Bruford, P Carninci, C Chelala, C Couillault, SJ de Souza, MA Debily, MD Devignes, Dubchak, I, T Endo, A Estreicher, E Eyras, K Fukami-Kobayash, GR Gopinath, E Graudens, Y Hahn, M Han, ZG Han, K Hanada, H Hanaoka, E Harada, K Hashimoto, U Hinz, M Hirai, T Hishiki, Hopkinson, I, S Imbeaud, H Inoko, A Kanapin, Y Kaneko, T Kasukawa, J Kelso, P Kersey, R Kikuno, K Kimura, B Korn, Kuryshev, V, Makalowska, I, T Makino, S Mano, R Mariage-Samson, J Mashima, H Matsuda, HW Mewes, S Minoshima, K Nagai, H Nagasaki, N Nagata, R Nigam, O Ogasawara, O Ohara, M Ohtsubo, N Okada, T Okido, S Oota, M Ota, T Ota, T Otsuki, D Piatier-Tonneau, A Poustka, SX Ren, N Saitou, K Sakai, S Sakamoto, R Sakate, Schupp, I, F Servant, S Sherry, R Shiba, N Shimizu, M Shimoyama, AJ Simpson, B Soares, C Steward, M Suwa, M Suzuki, A Takahashi, G Tamiya, H Tanaka, T Taylor, JD Terwilliger, P Unneberg, Veeramachaneni, V, S Watanabe, L Wilming, N Yasuda, HS Yoo, M Stodolsky, W Makalowski, M Go, K Nakai, T Takagi, M Kanehisa, Y Sakaki, J Quackenbush, Y Okazaki, Y Hayashizaki, W Hide, R Chakraborty, K Nishikawa, H Sugawara, Y Tateno, Z Chen, M Oishi, P Tonellato, R Apweiler, K Okubo, L Wagner, S Wiemann, RL Strausberg, T Isogai, C Auffray, N Nomura, T Gojobori, S Sugano
    PLOS BIOLOGY 2 6 856 - 875 2004年06月 [査読無し][通常論文]
     
    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for nonprotein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.
  • T Imanishi, T Itoh, Y Suzuki, C O'Donovan, S Fukuchi, KO Koyanagi, RA Barrero, T Tamura, Y Yamaguchi-Kabata, M Tanino, K Yura, S Miyazaki, K Ikeo, K Homma, A Kasprzyk, T Nishikawa, M Hirakawa, J Thierry-Mieg, D Thierry-Mieg, J Ashurst, LB Jia, M Nakao, MA Thomas, N Mulder, Y Karavidopoulou, LH Jin, S Kim, T Yasuda, B Lenhard, E Eveno, Y Suzuki, C Yamasaki, J Takeda, C Gough, P Hilton, Y Fujii, H Sakai, S Tanaka, C Amid, M Bellgard, MD Bonaldo, H Bono, SK Bromberg, AJ Brookes, E Bruford, P Carninci, C Chelala, C Couillault, SJ de Souza, MA Debily, MD Devignes, Dubchak, I, T Endo, A Estreicher, E Eyras, K Fukami-Kobayash, GR Gopinath, E Graudens, Y Hahn, M Han, ZG Han, K Hanada, H Hanaoka, E Harada, K Hashimoto, U Hinz, M Hirai, T Hishiki, Hopkinson, I, S Imbeaud, H Inoko, A Kanapin, Y Kaneko, T Kasukawa, J Kelso, P Kersey, R Kikuno, K Kimura, B Korn, Kuryshev, V, Makalowska, I, T Makino, S Mano, R Mariage-Samson, J Mashima, H Matsuda, HW Mewes, S Minoshima, K Nagai, H Nagasaki, N Nagata, R Nigam, O Ogasawara, O Ohara, M Ohtsubo, N Okada, T Okido, S Oota, M Ota, T Ota, T Otsuki, D Piatier-Tonneau, A Poustka, SX Ren, N Saitou, K Sakai, S Sakamoto, R Sakate, Schupp, I, F Servant, S Sherry, R Shiba, N Shimizu, M Shimoyama, AJ Simpson, B Soares, C Steward, M Suwa, M Suzuki, A Takahashi, G Tamiya, H Tanaka, T Taylor, JD Terwilliger, P Unneberg, Veeramachaneni, V, S Watanabe, L Wilming, N Yasuda, HS Yoo, M Stodolsky, W Makalowski, M Go, K Nakai, T Takagi, M Kanehisa, Y Sakaki, J Quackenbush, Y Okazaki, Y Hayashizaki, W Hide, R Chakraborty, K Nishikawa, H Sugawara, Y Tateno, Z Chen, M Oishi, P Tonellato, R Apweiler, K Okubo, L Wagner, S Wiemann, RL Strausberg, T Isogai, C Auffray, N Nomura, T Gojobori, S Sugano
    PLOS BIOLOGY 2 6 856 - 875 2004年06月 [査読有り][通常論文]
     
    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for nonprotein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.
  • Standardized phylogenetic tree: a tool to discover functional evolution
    J. Mol. Evol. 57 S174-181  2003年 [査読無し][通常論文]
  • Standardized phylogenetic tree: a tool to discover functional evolution
    Toshinori Endo, Soichi Ogishima, Hiroshi Tanaka
    J. Mol. Evol. 57 S174 - S181 2003年 [査読有り][通常論文]
  • T Sasaki, T Matsumoto, K Yamamoto, K Sakata, T Baba, Y Katayose, JZ Wu, Y Niimura, ZK Cheng, Y Nagamura, BA Antonio, H Kanamori, S Hosokawa, M Masukawa, K Arikawa, Y Chiden, M Hayashi, M Okamoto, T Ando, H Aoki, K Arita, M Hamada, C Harada, S Hijishita, M Honda, Y Ichikawa, A Idonuma, M Iijima, M Ikeno, S Ito, T Ito, Y Ito, Y Ito, A Iwabuchi, K Kamiya, W Karasawa, S Katagiri, A Kikuta, N Kobayashi, Kono, I, K Machita, T Maehara, H Mizuno, T Mizubayashi, Y Mukai, H Nagasaki, M Nakashima, Y Nakama, Y Nakamichi, M Nakamura, N Namiki, M Negishi, Ohta, I, N Ono, S Saji, K Sakai, M Shibata, T Shimokawa, A Shomura, JY Song, Y Takazaki, K Terasawa, K Tsuji, K Waki, H Yamagata, H Yamane, S Yoshiki, R Yoshihara, K Yukawa, HS Zhong, H Iwama, T Endo, H Ito, JH Hahn, HI Kim, MY Eun, M Yano, JM Jiang, T Gojohori
    NATURE 420 6913 312 - 316 2002年11月 [査読無し][通常論文]
     
    The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops(1-4). Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% ( 2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence(5) indicates the importance of a high-quality finished sequence.
  • T Sasaki, T Matsumoto, K Yamamoto, K Sakata, T Baba, Y Katayose, JZ Wu, Y Niimura, ZK Cheng, Y Nagamura, BA Antonio, H Kanamori, S Hosokawa, M Masukawa, K Arikawa, Y Chiden, M Hayashi, M Okamoto, T Ando, H Aoki, K Arita, M Hamada, C Harada, S Hijishita, M Honda, Y Ichikawa, A Idonuma, M Iijima, M Ikeno, S Ito, T Ito, Y Ito, Y Ito, A Iwabuchi, K Kamiya, W Karasawa, S Katagiri, A Kikuta, N Kobayashi, Kono, I, K Machita, T Maehara, H Mizuno, T Mizubayashi, Y Mukai, H Nagasaki, M Nakashima, Y Nakama, Y Nakamichi, M Nakamura, N Namiki, M Negishi, Ohta, I, N Ono, S Saji, K Sakai, M Shibata, T Shimokawa, A Shomura, JY Song, Y Takazaki, K Terasawa, K Tsuji, K Waki, H Yamagata, H Yamane, S Yoshiki, R Yoshihara, K Yukawa, HS Zhong, H Iwama, T Endo, H Ito, JH Hahn, HI Kim, MY Eun, M Yano, JM Jiang, T Gojohori
    NATURE 420 6913 312 - 316 2002年11月 [査読有り][通常論文]
     
    The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops(1-4). Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% ( 2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence(5) indicates the importance of a high-quality finished sequence.
  • T Endo, A Fedorov, SJ de Souza, W Gilbert
    MOLECULAR BIOLOGY AND EVOLUTION 19 4 521 - 525 2002年04月 [査読無し][通常論文]
     
    Are intron positions correlated with regions of high amino acid conservation? For a set of ancient conserved proteins, with intronless prokaryotic but intron-containing eukaryotic homologs, multiple sequence alignments identified residues invariant throughout evolution. Intron positions between codons show no preferences. However, introns lying after the first base of a codon prefer conserved regions, markedly in glycines. Because glycines are in excess in conserved regions, this behavior could reflect phase-one introns entering glycine residues randomly in the ancestral sequences. Examination of intron positions within codons of evolutionarily invariable amino acids showed that roughly 50% of these introns are bordered by guanines at both 5'- and 3'-ends, 25% have a G only before the intron, and 5% have a G only after the intron, whereas about 20% are bordered by nonguanine bases.
  • Do introns favor or avoid regions of amino acid conservation?
    Toshinori Endo, Alexei Fedorov, Sandro de Souza, Walter Gilbert
    Mol. Biol. Evol. 19 4 521 - 525 2002年 [査読有り][通常論文]
  • Harukazu Suzuki, Yoshifumi Fukunishi, Ikuko Kagawa, Rintaro Saito, Hiroshi Oda, Toshinori Endo, Shinji Kondo, Hidemasa Bono, Yasushi Okazaki, Yoshihide Hayashizaki
    Genome Research 11 10 1758 - 1765 2001年 [査読無し][通常論文]
     
    We have developed a novel assay system for systematic analysis of protein-protein interactions (PPIs) that is characteristic of a PCR-mediated rapid sample preparation and a high-throughput assay system based on the mammalian two-hybrid method. Using gene-specific primers, we successfully constructed the assay samples by two rounds of PCR with up to 3.6 kb from the first-round PCR fragments. In the assay system, we designed all the steps to be performed by adding only samples, reagents, and cells into 384-well assay plates using two types of semiautomatic multiple dispensers. The system enabled us examine more than 20,000 assay wells per day. We detected 145 interactions in our pilot study using 3500 samples derived from mouse full-length enriched cDNAs. Analysis of the interaction data showed both several significant interaction clusters and predicted functions of a few uncharacterized proteins. In combination with our comprehensive mouse full-length cDNA clone bank covering a large part of the whole genes, our high-throughput assay system will discover many interactions to facilitate understanding of the function of uncharacterized proteins and the molecular mechanism of crucial biological processes, and also enable completion of a rough draft of the entire PPI panel in certain cell types or tissues of mouse within a short time.
  • Y Sugahara, P Carninci, M Itoh, K Shibata, H Konno, T Endo, M Muramatsu, Y Hayashizaki
    GENE 263 1-2 93 - 102 2001年01月 [査読無し][通常論文]
     
    To enhance the usefulness of the laboratory mouse and to facilitate the rapid assay of gene functions we have been collecting the entire set of mouse full-length cDNA by one-pass sequencing. To collect full-length cDNA clones efficiently, it is critical to construct high-quality cDNA libraries. In recent years, we have been developing a way to construct full-length cDNA libraries by using biotinylation of the cap structure (the 'CAP-trapper' method) coupled with treatment to increase reverse transcriptase efficiency at high temperature by the addition of trehalose. In this paper we report our evaluation of the quality of CAP trapper and a number of other full-length cDNA libraries, including the results of 5' end analysis of clones in CAP trapper and the other Libraries. We used a procedure that compared the 5'-ends of cDNA clones with those of genes in the public databases. Our analysis showed that 63% of cDNA clones in CAP trapper libraries had sequences that were either the same length as those of equivalent genes in the public database or 5'-extended, and that 90% of these clones maintained their coding sequences. These results indicate that the CAP trapper Library is a promising tool for collecting full-length cDNA in large-scale projects. Comparison of the quality of CAP trapper with that of other full-length-cDNA libraries confirmed the value of these libraries. (C) 2001 Elsevier Science B.V. All rights reserved.
  • Harukazu Suzuki, Yoshifumi Fukunishi, Ikuko Kagawa, Rintaro Saito, Hiroshi Oda, Toshinori Endo, Shinji Kondo, Hidemasa Bono, Yasushi Okazaki, Yoshihide Hayashizaki
    Genome Research 11 10 1758 - 1765 2001年 [査読有り][通常論文]
     
    We have developed a novel assay system for systematic analysis of protein-protein interactions (PPIs) that is characteristic of a PCR-mediated rapid sample preparation and a high-throughput assay system based on the mammalian two-hybrid method. Using gene-specific primers, we successfully constructed the assay samples by two rounds of PCR with up to 3.6 kb from the first-round PCR fragments. In the assay system, we designed all the steps to be performed by adding only samples, reagents, and cells into 384-well assay plates using two types of semiautomatic multiple dispensers. The system enabled us examine more than 20,000 assay wells per day. We detected 145 interactions in our pilot study using 3500 samples derived from mouse full-length enriched cDNAs. Analysis of the interaction data showed both several significant interaction clusters and predicted functions of a few uncharacterized proteins. In combination with our comprehensive mouse full-length cDNA clone bank covering a large part of the whole genes, our high-throughput assay system will discover many interactions to facilitate understanding of the function of uncharacterized proteins and the molecular mechanism of crucial biological processes, and also enable completion of a rough draft of the entire PPI panel in certain cell types or tissues of mouse within a short time.
  • Y Sugahara, P Carninci, M Itoh, K Shibata, H Konno, T Endo, M Muramatsu, Y Hayashizaki
    GENE 263 1-2 93 - 102 2001年01月 [査読有り][通常論文]
     
    To enhance the usefulness of the laboratory mouse and to facilitate the rapid assay of gene functions we have been collecting the entire set of mouse full-length cDNA by one-pass sequencing. To collect full-length cDNA clones efficiently, it is critical to construct high-quality cDNA libraries. In recent years, we have been developing a way to construct full-length cDNA libraries by using biotinylation of the cap structure (the 'CAP-trapper' method) coupled with treatment to increase reverse transcriptase efficiency at high temperature by the addition of trehalose. In this paper we report our evaluation of the quality of CAP trapper and a number of other full-length cDNA libraries, including the results of 5' end analysis of clones in CAP trapper and the other Libraries. We used a procedure that compared the 5'-ends of cDNA clones with those of genes in the public databases. Our analysis showed that 63% of cDNA clones in CAP trapper libraries had sequences that were either the same length as those of equivalent genes in the public database or 5'-extended, and that 90% of these clones maintained their coding sequences. These results indicate that the CAP trapper Library is a promising tool for collecting full-length cDNA in large-scale projects. Comparison of the quality of CAP trapper with that of other full-length-cDNA libraries confirmed the value of these libraries. (C) 2001 Elsevier Science B.V. All rights reserved.
  • Takeshi Ishimizu, Toshinori Endo, Yumi Yamaguchi-Kabata, Kazuo T. Nakamura, Fumio Sakiyama, Shigemi Norioka
    FEBS Letters 440 3 337 - 342 1998年12月04日 [査読有り][通常論文]
     
    A stylar S-RNase is associated with gametophytic self-incompatibility in the Rosaceae, Solanaceae, and Scrophulariaceae. This S-RNase is responsible for S-allele-specific recognition in the self-incompatible reaction, but how it functions in specific discrimination is not clear. Window analysis of the numbers of synonymous (d(S)) and non-synonymous (d(N)) substitutions in rosaceous S-RNases detected four regions with an excess of d(N) over d(S) in which positive selection may operate (PS regions). The topology of the secondary structure of the S-RNases predicted by the PHD method is very similar to that of fungal RNase Rh whose tertiary structure is known. When the sequences of S-RNases are aligned with the sequence of RNase Rh based on the predicted secondary structures, the four PS regions correspond to two surface sites on the tertiary structure of RNase Rh. These findings suggest that in S-RNases the PS regions also form two sites and are candidates for the recognition sites for S-allele-specific discrimination. Copyright (C) 1998 Federation of European Biochemical Societies.
  • *Evolutionary significance of intra-genome duplications on human chromosomes.
    Toshinori Endo, Tadashi Imanishi, Takashi Gojobori, Hidetoshi Inoko
    Gene 210 2 351 - 354 1998年04月 [査読有り][通常論文]
  • Toshinori Endo, Tadashi Imanishi, Takashi Gojobori, Hidetoshi Inoko
    GENE 205 1/2 19 - 27 1997年12月 [査読有り][通常論文]
  • Yoshio Tateno, Kazuho Ikeo, Hidemi Watanabe, Toshinori Endo, Yumi Yamaguchi, Yoshiyuki Suzuki, Kazunari Takahashi, Kazuhisa Tsunoyama, Masahiko Kawai, Yuichi Kawanishi, Kimihiko. Naito, Takashi Gojobori
    J. Mol Evol. 44(Suppl 1) S38 - S43 1997年01月 [査読有り][通常論文]
  • Akishinonomiya Fumihito, Tetsuo Miyake, Masaru Takada, Ryosuke Shingu, Toshinori Endo, Takashi Gojobori, Norio Kondo, Susumu Ohno
    Proc. of Natl. Acad. Sci., USA. 93 13 6792 - 6795 1996年06月 [査読有り][通常論文]
  • T Endo, K Ikeo, T Gojobori
    MOLECULAR BIOLOGY AND EVOLUTION 13 5 685 - 690 1996年05月 [査読有り][通常論文]
     
    We conducted a systematic search for the candidate genes on which positive selection may operate, on the premise that for such genes the number of nonsynonymous substitution is expected to be larger than that of synonymous substitutions when the nucleotide sequences of the genes under investigation are compared with each other. By obtaining 3,595 groups of homologous sequences from the DDBJ, EMBL, and GenBank DNA sequence databases, we found that 17 gene groups can be the candidates for the genes on which positive selection may operate. Thus, such genes are found to occupy only about 0.5% of the vast number of gene groups so far available. Interestingly enough, 9 out of the 17 gene groups were the surface antigens of parasites or viruses.
  • Large-scale search for genes on which positive selection may operate.
    Toshinori Endo
    Dessertation. Graduate School for Advanced Studies 1995年 [査読有り][通常論文]
  • VC FOLETTA, MH SONOBE, T SUZUKI, T ENDO, H IBA, DR COHEN
    ONCOGENE 9 11 3305 - 3311 1994年11月 [査読有り][通常論文]
     
    Transcription factor AP-1 is comprised of multiple protein complexes that include members of a family of genes related to the proto-oncogene c-fos. In this report, we have extended the analysis of one member of this family, fos-related antigen-2 (fra-2), by isolating and characterising genomic and cDNA clones encoding:the mouse fra-2 homolog. The overall gene structure (number and positions of introns) was similar to that of both the chicken fra-2 gene and other members of the fos family, and the relative positions of putative enhancers in the 5' regulatory region were well conserved between the mouse and chicken fra-2 genes. High levels of fra-2 mRNA were detected in ovary, stomach, small and large intestine, brain, lung and heart. The mouse Fra-2 protein showed 94% and 87.5% conservation with human and chicken Fra-2, respectively, and mouse Fra-2, like the chicken homolog, induced transformation of chicken embryo fibroblasts. The characterisation of the mouse fra-2 gene provides a basis for analysis of Fra-2 function in the whole animal.
  • Takashi Kameda, Atsuko Akahori, Martha H. Sonobe, Takehisa Suzuki, Toshinori Endo, Hideo Iba
    Proc. Natl Acad of Sci., U.S.A. 90 20 9369 - 9373 1993年10月 [査読有り][通常論文]
  • T SUZUKI, H OKUNO, T YOSHIDA, T ENDO, H NISHINA, H IBA
    NUCLEIC ACIDS RESEARCH 19 20 5537 - 5542 1991年10月 [査読有り][通常論文]
     
    Fra-2, one of the Fos-related antigens, is promptly expressed after the growth stimulation of fibroblasts, but its induction peak is later than that of c-Fos. In this report, we examined biochemical properties of Fra-2 and compared them with those of two other Fos family proteins, c-Fos and Fra-1. Like c-Fos and Fra-1, Fra-2 formed stable heterodimers with c-Jun, JunB or JunD in vitro and all these complexes had specific DNA-binding activity to AP-1-binding sites (AP-1 sites) or related sequences. When transiently introduced into a mouse embryonic carcinoma cell line, F9, with reporter genes containing the AP-1 site from the collagenase gene, fra-2 plus c-jun suppressed the transactivation by c-jun alone. This property of Fra-2 is in clear contrast to that of c-Fos, which stimulates the transcriptional activity of c-Jun by forming a stable heterodimer. Analysis of chimeric proteins between c-Fos and Fra-2 indicated that this difference is mainly attributable to their C terminal-half regions. Interestingly, this suppressive effect of Fra-2 was not observed in the combination with JunD: fra-2 plus junD, like c-fos plus junD, had higher transcriptional activity than junD alone. Fra-1 showed essentially the same transcriptional regulatory properties as Fra-2. These differential properties greatly expand the potential range of regulatory functions of the Fos family proteins.

書籍

  • 分子生物学者のためのバイオインフォマティクス入門
    共立出版 2001年
  • Tracing Biological Evolution in Protein and Gene Structures.
    Takashi Gojobori, Toshinori Endo, Kazuho Ikeo (担当:共著範囲:Mitiko Go and Paul Schimmel eds. pp. 249-260)
    Elsevier Science B. V., Amsterdam. 1995年

講演・口頭発表等

  • Evolutionary analysis of Mollicutes based on standardized phylogenetic tree  [通常講演]
    遠藤 俊徳
    2013年10月 シンポジウム・ワークショップパネル(指名)

その他活動・業績

  • Yutaka Satou, Katsuhiko Mineta, Michio Ogasawara, Yasunori Sasakura, Eiichi Shoguchi, Keisuke Ueno, Lixy Yamada, Jun Matsumoto, Jessica Wasserscheid, Ken Dewar, Graham B. Wiley, Simone L. Macmil, Bruce A. Roe, Robert W. Zeller, Kenneth E. M. Hastings, Patrick Lemaire, Erika Lindquist, Toshinori Endo, Kohji Hotta, Kazuo Inaba GENOME BIOLOGY 9 (10) R152 2008年 [査読無し][通常論文]
     
    Background: The draft genome sequence of the ascidian Ciona intestinalis, along with associated gene models, has been a valuable research resource. However, recently accumulated expressed sequence tag (EST)/cDNA data have revealed numerous inconsistencies with the gene models due in part to intrinsic limitations in gene prediction programs and in part to the fragmented nature of the assembly. Results: We have prepared a less-fragmented assembly on the basis of scaffold-joining guided by paired-end EST and bacterial artificial chromosome (BAC) sequences, and BAC chromosomal in situ hybridization data. The new assembly (115.2 Mb) is similar in length to the initial assembly (116.7 Mb) but contains 1,272 (approximately 50%) fewer scaffolds. The largest scaffold in the new assembly incorporates 95 initial-assembly scaffolds. In conjunction with the new assembly, we have prepared a greatly improved global gene model set strictly correlated with the extensive currently available EST data. The total gene number (15,254) is similar to that of the initial set (15,582), but the new set includes 3,330 models at genomic sites where none were present in the initial set, and 1,779 models that represent fusions of multiple previously incomplete models. In approximately half, 5'-ends were precisely mapped using 5'-full-length ESTs, an important refinement even in otherwise unchanged models. Conclusion: Using these new resources, we identify a population of non-canonical (non-GT-AG) introns and also find that approximately 20% of Ciona genes reside in operons and that operons contain a high proportion of single-exon genes. Thus, the present dataset provides an opportunity to analyze the Ciona genome much more precisely than ever.

共同研究・競争的資金等の研究課題

  • GPGPUによる高速マルチプルアラインメント法の開発
    研究期間 : 2008年
  • Development of GPGPU based multiple sequence alignment method
    研究期間 : 2008年
  • オーファンエンザイム情報に基づく遺伝子機能予測
    ゲノム関係研究
    研究期間 : 2005年
  • Prediction of the function of hypothetical genes based on the information of orphan enzymes
    0104 (Japanese Only)
    研究期間 : 2005年

教育活動情報

主要な担当授業

  • 情報生物学特論
    開講年度 : 2019年
    課程区分 : 修士課程
    開講学部 : 情報科学研究科
    キーワード : バイオインフォマティクス、ゲノムとプロテオーム、分子進化、遺伝子発現、生物学データベース
  • 情報生物学特論
    開講年度 : 2019年
    課程区分 : 修士課程
    開講学部 : 情報科学院
    キーワード : バイオインフォマティクス、ゲノムとプロテオーム、分子進化、遺伝子発現、生物学データベース
  • バイオエンジニアリング特論
    開講年度 : 2019年
    課程区分 : 修士課程
    開講学部 : 情報科学研究科
    キーワード : 遺伝情報, genetic information, バイオインフォマティクス, bioinformatics, イメージング, imaging, 生体医工学, biomedical engineering, 細胞力学, cell mechanics
  • 情報生物学特論
    開講年度 : 2019年
    課程区分 : 博士後期課程
    開講学部 : 情報科学研究科
    キーワード : バイオインフォマティクス、ゲノムとプロテオーム、分子進化、遺伝子発現、生物学データベース
  • 情報生物学特論
    開講年度 : 2019年
    課程区分 : 博士後期課程
    開講学部 : 情報科学院
    キーワード : バイオインフォマティクス、ゲノムとプロテオーム、分子進化、遺伝子発現、生物学データベース
  • バイオエンジニアリング特論
    開講年度 : 2019年
    課程区分 : 博士後期課程
    開講学部 : 情報科学研究科
    キーワード : 遺伝情報, genetic information, バイオインフォマティクス, bioinformatics, イメージング, imaging, 生体医工学, biomedical engineering, 細胞力学, cell mechanics
  • 生物学Ⅰ
    開講年度 : 2019年
    課程区分 : 学士課程
    開講学部 : 全学教育
    キーワード : 生体高分子,細胞の構造と機能,エネルギー代謝,細胞の成長と分裂,遺伝現象と遺伝子発現制御
  • 細胞生物学
    開講年度 : 2019年
    課程区分 : 学士課程
    開講学部 : 工学部
    キーワード : 細胞の構造と機能、エネルギー代謝、細胞内情報伝達、細胞分裂と分化
  • 情報エレクトロニクス概論
    開講年度 : 2019年
    課程区分 : 学士課程
    開講学部 : 工学部
    キーワード : 情報理論, 計算機ハードウエア, 電子デバイス, 生体情報, 生命科学, 電子回路, 通信, メディア, ネットワーク, 電気回路, 制御工学

大学運営

委員歴

  • 2016年09月 - 現在   日本バイオインフォマティクス学会   幹事
  • 2011年04月 - 現在   日本情報処理学会   バイオ情報学研究会論文誌編集委員会
  • 2011年04月 - 現在   日本情報処理学会   バイオ情報学研究会運営委員会
  • 2009年 - 現在   日本遺伝学会   幹事   日本遺伝学会
  • 2003年 - 現在   生き物文化誌学会   常任理事   生き物文化誌学会
  • 2016年10月 - 2017年12月   第6回生命医薬情報学連合大会   大会長
  • 2011年   Society for Molecular Biology and Evolution   年会準備委員   Society for Molecular Biology and Evolution
  • 2008年 - 2010年   日本遺伝学会   第82回大会実行委員   日本遺伝学会
  • 2005年 - 2007年   日本バイオインフォマティクス学会   幹事   日本バイオインフォマティクス学会
  • 2005年 - 2007年   日本バイオインフォマティクス学会   北海道地域部会長   日本バイオインフォマティクス学会


Copyright © MEDIA FUSION Co.,Ltd. All rights reserved.