DDBJ Report 2011

Report for DNA data committee 2012-03-08  at ROIS central office in Tokyo           K.Okubo Attendant Board Members: Observers: =the 24th International Collaborators Meeting= Summary by Jun Mashima  International Nucleotide Sequence Database Collaboration (INSDC) members, DDBJ, EBI and NCBI, hold the international collaborators meeting every year. In 2011, the meeting was hosted by DDBJ and held on 23-27 May in Osaka to avoid the effect of Power shortage. Despite the sudden change in the venue, 5 GenBank staffs and 6 EBI staffs attended. The outcomes of the meeting are summarized below. The Items; Discussed and To Be Studied  http://farm8.staticflickr.com/7124/7049019407_328c727291_z.jpg
 * This year we went detail on "WHO IS USING DDBJ by the way?"

NCBI continues SRA and Trace Archive repositories after October 1, 2011.
Recently, NCBI announced that due to budget constraints, it would be discontinuing its SRA and Trace Archive repositories for high-throughput sequence data. However, NIH has since committed interim funding for SRA in its current form until October 1, 2011. In addition, NCBI has been working with staff from other NIH Institutes and NIH grantees to develop an approach to continue archiving a widely used subset of next generation sequencing data after October 1, 2011. In addition, NCBI will continue to provide access to existing SRA and Trace Archive data for the foreseeable future. NCBI is also continuing to discuss with NIH Institutes approaches for handling other next-generation sequencing data associated with specific large-scale studies.

New INSD entity dedicated to "Metadata": BioProject
Since 2005, INSDC has discussed project ID assignment as a flag to specify not only genomic and metagenomic sequencing projects but also many kinds of biological projects with considerable modifications. In 2011, the schema of BioProject is introduced. See also DDBJ BioProject Database. A BioProject is a collection of biological data related to a single initiative, originating from a single organization or from a consortium. A BioProject record provides users a single place to find links to the diverse data types generated for that project. The format of BioProject accession numbers is PRJ[D|E|N][A-Z]+integer; e.g. D=DDBJ; E=EBI; N=NCBI; for example: PRJNA38683

Agreement on Data exchange process and feature usage
See:[http://www.ddbj.nig.ac.jp/insdc/icm2011-e.html

History and background

 * Three Banks meets every year to accommodate changes/progresses in molecular biology （International Collaborative Meeting, ICM）. Past alterations in description rules and creation of new divisionsレトロフィットの実例集 Feature Table definition.[]Transition of DDBJ features

 23rd ICM  2010-5-19/2010-5-21:  Hinxton  Nakamura,Y.; Mashima,J.; Kodama,Y.  reflected in FT-doc xx.  22nd ICM  2009-5-12/2009-5-15:  Bethesda  Okubo,K.; Nakamura,Y.; Tateno,Y.; Mashima,J.; Kodama,Y.  21st ICM  2008-5-20/2008-5-22:  Mishima  all_DDBJ(Gojobori,T.; Sugawara,H.; Saitou,N.; Tateno,Y.; Ikeo,K.; Fukuchi,S.; Suzuki,Y.; Sumiyama,K. ...)  20th ICM <DATE> 2007-5-21/2007-5-23: <PLACE> Hinxton <ATTENDANT> Tateno,Y.; Sugawara,H.; Saitou,N.; Ikeo,K.; Suzuki,Y.; Mashima,J.; Aono,H. <EVENT> 19th ICM <DATE> 2006-5-15/2006-5-17: <PLACE> Bethesda <ATTENDANT> Tateno,Y.; Sugawara,H.; Mashima,J.; Okido,T. <EVENT> 18th ICM <DATE> 2005-5-15/2005-5-18: <PLACE> Mishima <ATTENDANT> all_DDBJ(Gojobori,T.; Sugawara,H.; Saitou,N.; Okubo,K.; Tateno,Y.; Fukuchi,S.; Barrero,R. ...) <EVENT> 17th ICM <DATE> 2004-5-17/2004-5-19: <PLACE> Hinxton <ATTENDANT> Tateno,Y.; Sugawara,H.; Fukuchi,S.; Mashima,J.; Kosuge,T. <EVENT> 16th ICM <DATE> 2003-5-19/2003-5-21: <PLACE> Bethesda <ATTENDANT> Tateno,Y.; Sugawara,H.; Miyazaki,S.; Mashima,J.; Sakai,K. <EVENT> 15th ICM <DATE> 2002-5-20/2002-5-22: <PLACE> Mishima <ATTENDANT> all_DDBJ

<EVENT> 20th IAC　<DATE> 2009-07-01 <PLACE> Polycom from NII <ATTENDANT> Miyano,S.,Sugano,S.<OBSERVER> Gojobori,T.; Okubo,K.; Nakamura,Y. <EVENT> 19th IAC　<DATE> 2008-08-22 <PLACE> Polycom from NII <ATTENDANT> Fujiyama,A.,Nakamura,H., Sugano,S. <OBSERVER> Gojobori,T.; Sugawara,H.; Tateno,Y.; Ikeo,K. <EVENT> 18th IAC　<DATE> 2007-06-19 <PLACE> Polycom from NII <ATTENDANT> Fujiyama A., Sugano,S., Nakamura H. <OBSERVER> Gojobori,T.; Sugawara,H.; Tateno,Y.; Saitou,N.; Ikeo,K. <EVENT> 17th IAC　<DATE> 2006-05-18 <PLACE> Bethesda <ATTENDANT> Fujiyama A., Sugano,S., Nakamura H. <OBSERVER> Gojobori,T.; Sugawara,H.; Tateno,Y.; Saitou,N.; Okubo,K.; Ikeo,K. <EVENT> 16th IAC　<DATE> 2005-05-19/2005-05-20 <PLACE> Mishima <ATTENDANT> Matsubara,K.、Fujiyama,A.,Sugano,S.<OBSERVER> Gojobori,T.; Sugawara,H.; Tateno,Y.; Saitou,N.; Okubo,K. <EVENT> 15th IAC　<DATE> 2004-05-20/2004-05-21 <PLACE> Hinxton <ATTENDANT> Fujiyama,A., Nakamura,H. <OBSERVER> Gojobori,T.; Sugawara,H.; Tateno,Y.; Okubo,K.; Fukuchi,S.
 * International Advisory Committee, IAC is a consulting and reviewing body that consists of scientists assigned by each bank from their own review board members.

=Growth of DDBJ release file (in bases)= http://farm6.staticflickr.com/5031/6902927268_14d738a4ef_z.jpg
 * Release files do not contain WGS, data in Table1

=DNA data submission to DDBJ=
 * We receive 3,000 small and 300 large submissions a year.
 * A Submission corresponds to a project = a reporting publication. Some include 1 but others may include million.
 * We receive 40,000 mails from past submitters and send 5000 mails ayear.
 * See table below.

Submission Trends in published data

 * Analysis by DBCLS cross categorization   []

Submission trends in data reception log
GfBt3KYINv4 <BR>
 * High resolution global submission trend analysis by Kouji Watanabe, Takahiro Kazama.

http://www.youtube.com/watch?v=GfBt3KYINv4 <BR>

One-by-one Submission (from Web form)to DDBJ

 * Institutional/Geographical, chronological, divisional and size(in entries, in base/bytes)distribution of submissions

Master-slave type Mass submission to DDBJ (EST,WGS,Complete annotated genomes)

 * Institutional/Geographical, chronological, divisional and size distribution of submissions

Raw data submission (DDBJ short read archive)

 * 1) Institutional/Geographical, chronological, experimental type and size distribution of submissions.

NGS Raw data submission (publish) :data by Eli Kaminuma

institute	published year	bp	Tera-bp --- DRA	2007	0			0.000 DRA	2008	0			0.000 DRA	2009	595135975		0.001 DRA	2010	67819543698		0.068 DRA	2011	1605037446684		1.605

ERA	2007	0			0.000 ERA	2008	0			0.000 ERA	2009	798289669634		0.798 ERA	2010	15131061600518		15.131 ERA	2011	29743421617416		29.743 -- SRA	2007	20304190149		0.020 SRA	2008	2245439186379		2.245 SRA	2009	12289099408838		12.289 SRA	2010	55960822016368		55.961 SRA	2011	105049317018548		105.049

=Releasing data and Providing Services= https://sites.google.com/a/g.nig.ac.jp/cib-ddbj-2010nendo-jigyou-houkoku/cib-ddbj2011nian-du-shi-ye?pli=1

Unique monthly visitors to DDBJ web
http://farm8.staticflickr.com/7203/6961871725_43a1dc7bec_n.jpg
 * Different year different column, different service by different color.
 * Log was analyzed by AWstat and all robot hits were removed.

http://farm8.staticflickr.com/7210/6795050788_5a2d0d6d21.jpg
 * The same data in row column converted.
 * Among popular services GIB was terminated because replaced by MiGAP and maintaining this is expensive.

DDBJ release files: ftp service
data from ftp log, that excluded visitors just browserd the ftp directory.

http://farm8.staticflickr.com/7177/6962080869_9903e3e53a_z.jpg

All ftp downloads including NGS data (DRA)
http://farm8.staticflickr.com/7178/6963496157_cf2205d490.jpg LOG data sorted out by Kouji Watanabe.

Crimson represents intra campus file transfer. Its is getting smaller and replaced with outward data flow!!

Ｑ(OK)：流量多すぎないか？ Ａ(kouwatan): DRA2テラはDRAにsubmissionがあったデータ（DRA"由来"とよばれる代物）で、 現在はSRA全体では公開データで270Tbase（not data size)にとうたつしていました. http://www.ncbi.nlm.nih.gov/Traces/sra/

DDBJの運営しているDRAでも 昨年夏の時点でBIOS100TBのMAIDに fastqがディスクいっぱいにせまる状態だったと記憶していますので 30TB fastqを落とされても別段おかしな動きではないと思われます.

=Public Work flows (Pipelines) on DDBJ computer system-- New --= --- Table. DDBJ cluster machine usage service		cores occupied BLAST,ClustalW		528 Open system		272 KaminumaPipe		196 MiGAP			56 DDBJ data processing	72 others			108
 * In response to the demand of handling massive data from the bench, DDBJ tested to provide pipelines in addition to single step analysis programs.

Annotation Pipeline for Bacterial Genome (MiGAP)
Created by H.SUgawara and Insilicobiology at DBCLS, service provided by DDBJ computers http://farm8.staticflickr.com/7066/6940451121_25a17898b2.jpg
 * Log data provided from Insilicobio and analyzed by OK.
 * Raw data in CSV file [[File:MiGAP LOG aggregated.doc]]

DDBJ Read Annotation pipeline
Created by the grant for Y. Nakamura, Designed by E. Kaminuma. Service provided with DDBJ computers. http://farm8.staticflickr.com/7193/6794175436_6093c68bd7_z.jpg
 * LOG provided by Kaminuma and analyzed by OK.
 * total Job:305 (INCLUDING ESTIMATED SIZE=0, 29 Jobs) 1-60523 Mbase?
 * Users (different contact mails): 41 (including total ES=0, 7 users)
 * User domain (different contact domains):
 * Top 12 (in ES) users account for more than 95% of total ES,
 * Top 12 users account for more than 76% of non zero Jobs (210 of 276)

Values are "Estimated Output Size in Mb" defined by Kaminuma
 * 1) Total Output of DDBJ Read Annotation pipeline in Jan 2011-jan 2012

=Complete renewal of HW and SW for DDBJ operation 2010-2012= DDBJ is expected to handle the 1000 fold bigger data with the slightly decreasing funding. To achieve this, machines and soft wares should be replaced with much more cost effective ones. Moreover, less used services, services to narrower fractions, should be reviewed and suspended if necessary to save maintenance efforts. Osamu Ogasawara and T.Takagi with TTF SadW_jYU17k <BR> QTH5ufBR2co http://www.youtube.com/watch?v=SadW_jYU17k <BR>
 * Retired old computer system New computer system 2012

Old system conponents

 * 1) DB construction and Public use
 * 2) *SMPserver: PRimePOWER2500（28cpu）
 * 3) * Storage ETERNUS8000（125TB）
 * 4) *Blade server BladeSymphony
 * 5) Similarity search
 * 6) *PC cluster Primergy RX200S3（66nodes）
 * 7) Key Word search of DDBJ
 * 8) *XML search appliance ShunsakuEngine（2520cpu）
 * 9) Molecular biology applications
 * 10) *SMP server PRmimePOWER2500（32cpu）
 * 11) *PC cluster PRrimergyRX200S3（66nodes）
 * 12) Back-up
 * 13) *Tape archive machine VD800（750Ｔバイト）

New system components
http://farm8.staticflickr.com/7044/6962864105_9a6fb3bbb5_z.jpg
 * 1) Faceless nodes, Fast I/O Disks and Inexpensive Disks for flexible use.

Comparison of usage b/w old and new supercomp

 * old:http://rgm3.lab.nig.ac.jp/Repository2/nig_supercomp0/table1.html ( password required)
 * new:http://www.ddbj.nig.ac.jp/system/supercom/supercom-util.html (open to the public)

Comparison of the BLAST performance
=Abolish inefficient and expensive "commercial programs" and full scratch construction with Open SOurces= Osamu Ogasawara and the system migration team (Moriyama, Watanabe, Kawahara, Shiida, Ehara and Yabuta) with aids from RNAi, IMSLab, Inc.


 * Systems re-implemented and in operation
 * 1.	DDBJ Daily update data producing system (Daily Format converting process from G,E to D)
 * 2.	DDBJ Release data producing system (Format converting process from G,E to D)
 * 3.	GetEntry (version management system for all DDBJ records)
 * 4.	ARSA (keyword search system for 0.15 billion DDBJ records)
 * System being ported on an open source DBMS.
 * 1.	TSUNAMI (Data management system for DDBJ data curation process)
 * Systems in process of re-implementation
 * 1.	SAKURA (DDBJ data submission system)
 * 2.	Web services of NCBI BLAST and ClustalW



Open Source Entry Retrieval System for all versions of records
DDBJ GetEntry is a searching service which returns DDBJ data records (nucleic acid sequence data and annotation) by given identifiers (called “accession numbers”). This system manages update history information of the data records as well.
 * Comparison of version management system GetEntry for whole DDBJ

Open Source Key word search for whole DDBJ

 * Comparison of Key word search for whole DDBJ

= Academic Presentation / Training Course / Visitor Tour / Public Relations =

Planning & Management:　Eli Kaminuma,　Yasukazu Nakamura

Academic presentation

 * DDBJ demonstrated journal papers, journal documents, and oral/poster presentations to academic community as follows.

<Event> Journal Paper <Date> 2012-01 <Note> Title= The sequence read archive: explosive growth of sequencing data <Note> Author= Yuichi Kodama, Martin Shumway, Rasko Leinonen <Note> Citation= Nucleic Acids Res. 2012 Jan;40(Database issue):D54-6.

<Event> Journal Paper <Date> 2012-01 <Note> Title= The DNA Data Bank of Japan launches a new resource DDBJ Omics Archive of functional genomics experiments <Note> Author= Yuichi Kodama, Jun Mashima, Eli Kaminuma, Takashi Gojobori, Osamu Ogasawara, Toshihisa Takagi, Kousaku Okubo and Yasukazu Nakamura <Note> Citation= Nucleic Acids Res. 2012 Jan;40(Database issue):D38-42.

<Event> Journal Document <Date> 2011-09 <Note> Title= DDBJ の現在：継承と変革 <Note> Author= 中村保一,小笠原理,神沼英里,菅原秀明,高木利久,大久保公策 <Note> Citation= 実験医学増刊 Vol.29 No.15 使えるデータベース・ウェブツール有田正規(編)

<Event> Journal Document <Date> 2011-09 <Note> Title= DDBJの塩基配列解析ツールと登録システム <Note> Author= 長崎英樹、神沼英里 <Note> Citation= 実験医学増刊 Vol.29 No.15 使えるデータベース・ウェブツール有田正規(編)

<Event> Journal Document <Date> 2011-08 <Note> Title= DDBJ Cloud：次世代シークエンサーの配列アーカイブとクラウド型解析ツール <Note> Author= 神沼英里、中村保一 <Note> Citation= 細胞工学8月号　特集:次世代シークエンサーを使いこなす 清水厚志，佐々木貴史(監修）

<Event> Journal Document <Date> 2011-03 <Note> Title= クラウド型計算機資源利用に基づいた新型DNAシークエンサ大量配列解析 <Note> Author= 神沼英里、中村保一 <Note> Citation= 化学と生物3月号 植物科学におけるバイオインフォマティクス活用法 矢野健太郎（編）

<Event> Poster Presentation <Date> 2011-12-14 <Place> meeting=第34回日本分子生物学会年会, Place=パシフィコ横浜 <Note> Title=「DDBJ created a new Web-based nucleotide sequence data submission tool, code name: D-easy」 <Note> Author= Takehide Kosuge, Jun Mashima, Haru Tsutsui, Mayumi Ejima, Eli Kaminuma, Toshihisa Takagi, Kousaku Okubo, and Yasukazu Nakamura

<Event> Poster Presentation <Date> 2011-12-14 <Place> meeting=第34回日本分子生物学会年会, Place=パシフィコ横浜 <Note> Title= 「New resources of DNA Data Bank of Japan: DDBJ Omics Archive and BioProject」 <Note> Author= Yuichi Kodama, Asami Nozaki, Kyung-Bum Lee, Jun Mashima, Eli Kaminuma, Toshihisa Takagi, Kousaku Okubo, Yasukazu Nakamura

<Event> Poster/Oral Presentation <Date> 2011-12-14 <Place> meeting=第34回日本分子生物学会年会, Place=パシフィコ横浜 <Note> Title = 新型シーケンサ・アーカイブ配列のクラウド型解析パイプラインDDBJ Pipeline進捗: de novoアセンブル配列注釈ワークフロー <Note> Author = 長崎英樹,望月孝子,児玉悠一,猿橋智,高木利久,大久保公策,神沼英里,中村保一

<Event> Poster Presentation <Date> 2011-11-28 <Place> meeting=8th Solanaceae and 2nd Cucurbitaceae Joint Conference, Place=Kobe Convention Center <Note> Title= DDBJ Sequence Read Archive and a cloud-computing based analytical pipeline for next-generation sequencing reads <Note> Author= Satoshi Saruhashi, Takako Mochizuki, Hideki Nagasaki, Yuichi Kodama, Asami Nozaki, Kyung-Bum Lee, Toshihisa Okido, Emi Yokoyama, Eli Kaminuma, Hideaki Sugawara, Kousaku Okubo, Toshihisa Takagi, Yasukazu Nakamura

<Event> Poster Presentation <Date> 2011-10-05 <Place> meeting=NBDC トーゴーの日シンポジウム2011, Place=日本科学未来館みらいCANホール <Note> Title = DNA Data Bank of Japan <Note> Author = 中村 保一, 小笠原理, 神沼 英里, 高木 利久, 大久保 公策

<Event> Poster Presentation <Date> 2011-07-10 <Place> meeting=The 2nd Int. Conf. on the Progress of “1000 Plant & Animal Reference Genomes Project”, Place=BGI Shenzhen, China <Note> Title = DDBJ Sequence Read Archive and a cloud-computing based annotation tool for next-generation sequencing data <Note> Author = Hideki Nagasaki, Takako Mochizuki, Yuichi Kodama, Satoshi Saruhashi, Toshihisa Takagi, Kousaku Okubo, Eli Kaminuma, Yasukazu Nakamura

<Event> Poster Presentation <Date> 2011-05-28 <Place> meeting= NGS現場の会 第１回研究会、Place = 熱海ニューフジヤホテル <Note> Title = DDBJ Read Annotation Pipeline : 新型シーケンサ由来配列のクラウド型解析パイプライン <Note> Author = 猿橋 智, 児玉 悠一, 李 慶範, 大城戸 利久, 横山 会美, 長崎 英樹, 望月 孝子, 神沼 英里, 菅原 秀明, 高木 利久, 大久保 公策,中村 保一

<Event> Poster Presentation <Date> 2011-03-20 (canceled) <Place> meeting = 第52回日本植物生理学会年会 <Note> Title = DDBJ Read Annotation Pipeline : 新型シーケンサ由来配列のクラウド型パイプライン <Note> Author = 長崎英樹，望月孝子，神沼英里，渡邊成樹，児玉悠一，猿橋智，菅原秀明，高木利久, 大久保公策,中村保一

<Event> Poster Presentation <Date> 2011-03-14 (canceled) <Place> meeting= 第5回日本ゲノム微生物学会年会 <Note> Title = DNA Data Bank of Japan 〜新型シークエンサからのデータ登録・解析 <Note> Author = 猿橋 智, 児玉 悠一, 李 慶範, 大城戸 利久, 横山 会美, 長崎 英樹, 望月 孝子, 神沼 英里, 菅原 秀明, 高木 利久,大久保 公策, 中村 保一

Training Course
<Event> original training course <Date> 2012-01-26 <Place> meeting= 25th DDBJing, Place= 国立遺伝学研究所 生命情報・DDBJ 研究センター 4F <Attendant> ... <Note> Title=　「はじめに」　<Note>　Author= 神沼　英里 <Note> Title= 「DDBJ の NGS 対応」　<Note> Author= 中村　保一 <Note> Title= 「NGS マルチプレックス法による黄色ブドウ球菌de novo アセンブリ」　<Note> Author= 佐々木　貴史 （慶應義塾大学医学部　特別研究講師） <Note> Title= 「MiGAP～微生物ゲノム注釈ツール利用法」　<Note> Author= 大山　彰 （インシリコバイオロジー株式会社　代表取締役） <Note> Title= 「DBCLS Galaxy： ツール群, 日本語統合環境, etc.」　<Note> Author= 山口　敦子（DBCLS　特任准教授） <Note> Title= 「DDBJ Sequence Read Archive(DRA) の紹介」　<Note> Author= 児玉　悠一 <Note> Title= 「DDBJ pipeline 基礎(de novo assembly)」　<Note> Author= 望月　孝子 <Note> Title= 「DDBJ pipeline 高次部(galaxy : contig annotation workflow)」　<Note> Author= 長崎　英樹 <Note> Title= 「DDBJ pipeline 高次部(galaxy : 系統樹解析)」　<Note> Author= 猿橋　智 <Note> Title= 「NGS 由来アセンブル配列の登録～大量登録システム(MSS)」　<Note> Author= 大城戸　利久
 * DDBJ provided the following invited lectures and the original training course 'DDBJing' to requested users twice a year. Topics of the DDBJing course are focused on how to use DDBJ's data registration systems and sequence annotation tools, especially for next generation sequencing reads.

<Event> invited lecture <Date> 2011-12-21 <Place> meeting= 新学術領域研究「複合適応形質進化の遺伝子基盤解明」，第６回インフォマティクス情報交換会, Place= 東京大学農学部 <Note> Title= 「DDBJ Sequence Read Archive の紹介」 <Note> Author= 児玉悠一

<Event> invited lecture <Date> 2011-11-29 <Place> meeting= シーケンサー利用技術講習会，第7回ロシュ・ダイアグノスティックス社 Genome Sequencer jr編, Place= 理化学研究所横浜研究所 <Note> Title= 「次世代シーケンサーデーターのDRA (DDBJ Sequence Read Archive) への登録」 <Note> Author= 児玉悠一

<Event> invited short lecture <Date> 2011-11-18 <Place> meeting= NBRPコムギ分科会, Place= 国立遺伝学研究所 <Note> Title= 「DDBJサービスの紹介」 <Note> Author= 神沼英里

<Event> invited lecture <Date> 2011-08-19 <Place> meeting=第154回農林交流センターワークショップ, Place= 農林交流センター <Note> Title= 「次世代シーケンサーのアーカイブDBとクラウド型データ解析システムの紹介」 <Note> Author= 神沼英里1, 児玉悠一1, 猿橋智1,望月孝子1,長崎英樹1, 高木利久1, 大久保公策1,2,中村保一1

<Event> original training course <Date> 2011-06-30 <Place> meeting= 24th DDBJing, Place= ライフサイエンス統合データベースセンター (DBCLS) <Attendant> ... <Note> Title= 「DDBJ の紹介・データ受付など」　<Note> Author= 中村　保一 <Note> Title= 「NGS のベースコール精度とアセンブリ精度について」　<Note> Author= 新井　理 （ビッツ株式会社　三島研究所 所長） <Note> Title= 「MiGAP～微生物ゲノム注釈ツール利用法」　<Note> Author= 大山　彰 （インシリコバイオロジー株式会社　代表取締役） <Note> Title= 「DDBJ Sequence Read Archive (DRA) へのデータ登録」　<Note> Author= 児玉　悠一 <Note> Title= 「Pipeline 基礎（アセンブリ・マッピング）」　<Note> Author= 望月　孝子 <Note> Title= 「Pipeline 高次（galaxyとゲノム・SNP 解析例）」　<Note> Author= 長崎　英樹 <Note> Title= 「NGS 由来アセンブル配列の登録～大量登録システム(MSS)」　<Note> Author= 大城戸　利久

<Event> invited lecture <Date> 2011-01-28 <Place> meeting=東北大学グローバルCOE第7回Network medicine特論講義,Place=東北大学医学系研究科 <Note> Title=「新型シークエンサ配列解析のためのクラウド型計算機資源利用法」 <Note> Author = 神沼英里

Visitor Tour / Guidance

 * All tour requests from outside visitors were accepted in DDBJ.

Visitor list <Event> SiteVisit2011nn <Date> 2012-02-27 <Attendant> 神沼 <Note> visitor=静岡県　県庁職員 3名 <Event> SiteVisit2011nn <Date> 2011-11-17 <Attendant> 神沼 <Note> Visitor=三島市山田中学校　「職場体験学習」 2名 <Event> SiteVisit2011nn <Date> 2011-08-16 <Attendant> 中村、神沼 <Note> visitor=神奈川県立柏陽高校 5名 <Event> SiteVisit2011nn <Date> 2011-08-02 <Attendant> 中村、神沼 <Note> visitor=静岡県　医療衛生担当理事 1名 <Event> SiteVisit2011nn <Date> 2011-05-10 <Attendant> 神沼 <Note> visitor=神奈川県立横須賀高校2年生　11名 <Event> SiteVisit2011nn <Date> 2011-03-07 <Attendant> 大久保 <Note> visitor=愛媛大学・学部生(数名)

Public Relations ( Release news / Mail magazine / Q&A )
DDBJ provide data bank information to users via three channels : release news, mail magazine, and Questions&Answers(Q&A).

Data release news
Hot topics from latest DDBJ entries were selected by Dr. Mashima (the head annotator), and information teams generate release documents and register them to several media : DDBJ　Top　page, Hot Topics, and　Twitter（2010.04 started, 183 followers and 11 lists）. <Event> DDBJ Hot Topics <Date> 2012-02-02 <Note> ユーカリ (Eucalyptus camaldulensis) EST 58,584エントリの公開 <Event> DDBJ Hot Topics <Date> 2012-01-25 <Note> ユーカリ (Eucalyptus camaldulensis) ゲノム配列 CON 28,672エントリ，WGS:274,001エントリ，raw data:DRA000466, DRA000467　の公開 <Event> DDBJ Hot Topics <Date> 2012-01-11 <Note> ブタ (Sus scrofa) full length enriched cDNA 配列 11,858エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-12-28 <Note>　イネ ひとめぼれ (Oryza sativa Japonica Group cv. Hitomebore) ゲノム配列　WGS 64,745エントリと scaffolo CON 12エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-12-06 <Note>　ヒト (Homo sapiens) MGA データの公開 <Event> DDBJ Hot Topics <Date> 2011-10-24 <Note>　肝吸虫 (Clonorchis sinensis) WGS 6,190エントリと scaffold CON 2,555エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-09-26 <Note>　肝吸虫 (Clonorchis sinensis) WGS 60,778エントリと scaffold CON 16,212エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-09-16 <Note>　カイコ (Bombyx mori) EST と full length cDNA データの新規公開 <Event> DDBJ Hot Topics <Date> 2011-09-13 <Note>　清酒酵母 (Saccharomyces cerevisiae Kyokai no. 7) WGS 705エントリと scaffold CON 14エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-08-03 <Note>　マボヤ (Halocynthia roretzi) EST 52,250エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-07-29 <Note>　コユビミドリイシ (Acropora digitifera) WGS 53,640えんとり，scaffold CON 4,171エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-07-01 <Note>　Botryococcus braunii TSA 56,465エントリ と EST 9,345エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-06-09 <Note>　タマーワラビー (Macropus eugenii) EST 266,600エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-05-11 <Note>　カタユウレイボヤ TPA-WGS 6,374エントリ, TPA-scaffold CON 1,272エントリの公開 <Event> DDBJ Hot Topics <Date> 2011-04-05 <Note>　オオムギ はるな二条 (Hordeum vulgare subsp. vulgare cv. Haruna Nijo) WGS 8,583 エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-03-25 <Note>　アフリカイネ (Oryza glaberrima) GSS 437,642 エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-03-23 <Note>　オオムギ (Hordeum vulgare subsp. vulgare) full length cDNA 23,614 エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-02-25 <Note>　イネ (Oryza sativa Japonica Group) GSS 59,716 エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-02-07 <Note>　マウス (Mus musculus domesticus) GSS 131,507 エントリの新規公開 <Event> DDBJ Hot Topics <Date> 2011-01-11 <Note>　バイオ燃料作物, ナンヨウアブラギリ (Jatropha curcas) 全ゲノム　genome BAC clones:17エントリ，WGS:150,417エントリ，raw data:DRA000305, DRA000306 と cDNA raw data:DRA000303, DRA000304 の配列データの新規公開 <Event> DDBJ Hot Topics <Date> 2011-01-11 <Note>　肝吸虫 (Clonorchis sinensis) の EST 52,745エントリの新規公開

真島淳(構築チーム)　Jun Mashima 小平順子(情報チーム)  Junko Kohira 鈴木紀美子(情報チーム) Kimiko Suzuki 柳楽幸子(情報チーム)  Sachiko Nagira 平田郁枝(情報チーム)  Fumie Hirata

Mail magazine
Information teams introduce DDBJ activities to users by digital mail magazine. The DDBJ mail magazine approaches to all-nig mailinglist (including around 630 persons) and original mailing list(including around 3,000 persons).

<Event> DDBJ mail magazine No.68 <Date> 2012-02-03 <Note> （重要）スーパーコンピュータシステムの移行に伴うサービスの中断・変更など ／ DDBJ リリース 88.0，DAD リリース 58.0 完成 ／「第25回 DDBJing 講習会 in 三島」 資料ダウンロード ／ 大量データの公開 <Event> DDBJ mail magazine No.67 <Date> 2011-12-27 <Note> DDBJ 年末年始休業のお知らせ／ (2012/1/12-18) DDBJ 公開サービスの一時停止予定／Nucleic Acids Research に DDBJ に関する論文発表／ DDBJ/EMBL/GenBank Feature Table Definition 改訂 ／ ヒト (Homo sapiens) MGA データの公開 ／ "困った de Ｑ" 第７回 <Event> DDBJ mail magazine No.66 <Date> 2011-11-30 <Note> 「第25回 DDBJing 講習会 in 三島」 開催のお知らせ／ 第34回日本分子生物学会年会 ブース出展・ポスター発表／ DDBJ 年末年始休業のお知らせ／ (12/6) 国立遺伝学研究所ならびに DDBJ ネットワークの一時不通／ "SAKURA de Ｑ" 第１０回 <Event> DDBJ mail magazine No.65 <Date> 2011-10-27 <Note> 「第25回 DDBJing 講習会 in 三島」 開催のお知らせ／ (11/11-14) 国立遺伝学研究所の停電による公開サービスの停止／ DDBJ BioProject サービス提供開始／ DDBJ の塩基配列解析ツールについて(後編) ／ 大量データの公開／ GIB サービス再開／ DDBJ アノテータの業務紹介 ～ ６．開発ユニットより <Event> DDBJ mail magazine No.64 <Date> 2011-09-29 <Note> DDBJ リリース 87.0，DAD リリース 57.0 完成 ／ 第24回国際実務者会議 報告／ DDBJ BioProject ウェブサイト公開 ／ 書籍「使えるデータベース・ウェブツール」の紹介／ DDBJ の塩基配列解析ツールについて(前編) ／ SAKURA での登録時にFAX 番号入力が必須に変更 ／ 大量データの公開 ／ "SAKURA de Ｑ" 第９回 ／ DDBJ アノテータの業務紹介 ～ ５．DDBJ Sequence Read Archive <Event> DDBJ mail magazine No.63 <Date> 2011-08-04 <Note> DDBJ HP の CMS 導入のお知らせ ／ 「第24回 DDBJing 講習会 in 東京」 終了／ 大量データの公開 ／ "困った de Ｑ" 第６回 ／ DDBJ アノテータの業務紹介 ～ ４．更新について(後編) <Event> DDBJ mail magazine No.62 <Date> 2011-06-28 <Note> DDBJ リリース 86.0，DAD リリース 56.0 完成 ／ 日本特許庁の特許塩基配列データ 公開／ 平成23年度ＤＤＢＪオープンシステムプロジェクトの募集／ International Collaborators Meeting 2011(ICM2011) を開催しました／ The Bioinformatics Roadshow 開催のお知らせ／ 大量データの公開／ ARSA の 一時休止／ DDBJ アノテータの業務紹介 ～ ４．更新について(前編) <Event> DDBJ mail magazine No.61 <Date> 2011-05-31 <Note> 「DDBJing 講習会 in 東京」 開催 ／ ARSA サービス再開／ 大量データの公開／ "困った de Ｑ" 第５回／ DDBJ アノテータの業務紹介 ～ ３．大量登録システム(MSS) の利用（後編） <Event> DDBJ mail magazine No.60 <Date> 2011-04-27 <Note> BLAST, ClustalW サービス再開 ／ 大量データの公開／ "困った de Ｑ" 第４回／ DDBJ アノテータの業務紹介 ～ ３．大量登録システム(MSS) の利用（前編） <Event> DDBJ mail magazine No.59 <Date> 2011-03-31 <Note> DDBJ リリース 85.0，DAD リリース 55.0 完成／ DEFINITION 行に特許公報番号と配列番号を追加／ 2011年度 国立遺伝学研究所 一般公開　中止／ 大量データの公開／ "困った de Ｑ" 第３回／ DDBJ アノテータの業務紹介 ～ ２．SAKURA での登録（後編） <Event> DDBJ mail magazine 号外  <Date> 2011-03-14 <Note> 東京電力による計画停電に対応したサービス停止のお知らせ <Event> DDBJ mail magazine No.58 <Date> 2011-03-01 <Note> DDBJ リリース84.1，DAD リリース 54.1 公開／ DDBJ/EMBL/GenBank Feature Table Definition 改訂／ DDBJ will continue Sequence Raw Data Archiving／ 大量データの公開／ 国立遺伝学研究所 大型計算機（supernig）利用申請継続手続きのご案内／ "ＳＡＫＵＲＡ de Ｑ" 第９回／ DDBJ アノテータの業務紹介 ～２．SAKURAでの登録（前編） <Event> DDBJ mail magazine No.57 <Date> 2011-02-01 <Note> Nucleic Acids Research に DDBJ に関する論文掲載／ DAD リリース 54.0 完成／ FTP の dra ディレクトリの構成変更／ 「DDBJing 講習会(23) ＆ PDBj 講習会 in 長浜」 終了／ 大量データの公開／ 「ユーザーの皆様へ，お願いです！」～その４．登録したいデータの量が多くて困った時のヒント集

小平順子（情報チーム）  Junko Kohira 鈴木紀美子（情報チーム） Kimiko Suzuki 柳楽幸子（情報チーム）  Sachiko Nagira 平田郁枝（情報チーム）  Fumie Hirata

Questions & Answers

 * DDBJ accepts users' questions from DDBJ homepage and DDBJ tag of DBCLS "LifeScience QA" site from November 2010. Most answers are created by annotators and faculties. They are registered in the appropriate sites by the information team.


 * 1) DDBJ QA site http://www.ddbj.nig.ac.jp/addresses-j.html
 * 2) Life science QA


 * Question on DDBJ tag http://qa.lifesciencedb.jp/tags/ddbj/
 * Question on DRA tag http://qa.lifesciencedb.jp/tags/dra/

真島淳（構築チーム） Jun Mashima 柳楽幸子（情報チーム） Sachiko Nagira

= Services in disaster and accidents=
 * As a solution of the power-off problem after the March 2011, emergency-oriented services were created on a rental server（さくらのレンタルサーバ）. The following two services on the rental server have been working due to 2012 NIG supercomputer replacement (Feb 27,2012～).


 * 1) http://ddbj.sakura.ne.jp/ (DDBJ Homepage)
 * 2) http://v-sakura.ddbj.nig.ac.jp/ (SAKURA data submission tool for the INSDC database)

All Faculties All Takagi-TF Members Related System Engineers 小菅武英（構築チーム） Takeo Kosuge 鈴木紀美子（情報チーム） Kimiko Suzuki http://farm7.static.flickr.com/6204/6146135415_5f3b325d97.jpg http://farm7.static.flickr.com/6065/6146683186_de682c729f.jpg
 * <DDBJevent>Accident<Date>2011-03-17<Note>after the EQ, service shift under rolling blackout

http://farm7.static.flickr.com/6174/6146137229_56fdca0cab.jpg
 * <DDBJevent>Accident<Date>2011-06-27<Note>Precise temperature measurement for power saving in comp bldg. Dr.Yamada@takagi lab (seen here) led temperature project in NIG-wide.

=Statistics in Tables = Accumulative contribution to INSDC (Bases)
 * Table1
 * Rel.(YYYY/MM)	DDBJ		JPO		KIPO		EMBL		EPO		GenBank		USPTO		total (INSD)
 * Re88 (2011/12)	12,249,278,699	3,895,623,710	93,982,299	18,479,528,264	3,422,717,493	92,779,951,222	4,035,027,362	134,956,109,049
 * re84 (2010/12)	11,048,142,428	3,123,592,651	93,982,299	16,853,062,085	3,113,242,691	84,041,012,627	2,646,101,925	120,919,136,706
 * Re80 (2009/12)	10,247,527,187	1,333,894,516	93,982,299	15,338,392,406	2,420,290,538	77,881,426,775	2,321,348,531	109,636,862,252
 * Re76 (2008/12)	9,905,388,043	984,443,076	93,982,299	13,004,175,237	1,405,240,475	71,978,805,767	1,369,873,549	98,741,908,446
 * Re72 (2007/12)	8,566,994,109	636,401,133			10,496,155,273	1,258,532,107	60,928,688,461	705,474,404	82,592,245,487

R72(2007/12)	R76(2008/12)	R80(2009/12)	R84(2010/12)	R88(2011/12) Human		424,419 	467,117 	489,329 	527,222 	549,320 Primates	94,996 		67,624 		77,599 		84,108 		100,839 Mammal		152,550 	173,754 	217,485 	248,889 	296,080 Rodent		311,361 	357,320 	369,592 	411,543 	428,928 Vertebrates	402,616 	512,557 	636,361 	759,643 	901,031 Invertebrates	536,502 	714,854 	907,262 	1,277,072 	1,705,900 Plant		963,099 	1,329,785 	1,603,115 	1,872,960 	2,267,506 Bacteria	347,693 	422,583 	530,723 	660,618 	766,137 Viral		504,218 	622,752 	771,327 	916,407 	1,097,112 Phage		3,551 		4,079 		5,197 		5,631 		6,365 Environmental	665,658 	938,965 	1,868,959 	2,934,874 	3,973,175 Patent		4,524,074 	7,003,597 	11,655,001 	18,264,606 	23,134,648 Synthetic	54,238 		76,638 		90,452 		95,427 		121,592 Contig		4,232,743 	5,737,343 	5,865,342 	6,240,252 	6,901,504 EST		47,183,975 	58,896,005 	63,788,455 	67,359,510 	71,312,541 GSS		21,292,899 	24,659,950 	27,147,321 	29,744,074 	32,874,011 HTC		493,673 	531,468 	551,354 	568,039 	535,729 HTG		116,594 	138,214 	143,707 	145,110 	145,891 STS		931,704 	1,299,644 	1,310,769 	1,318,583 	1,322,165 TSA		0 		3,226 		149,952 	1,411,098 	4,322,705 UNA		278 		277 		290 		288 		290 total		83,236,841 	103,957,752 	118,179,592 	134,845,954 	152,763,469
 * Table2 Growth of DDBJ release by division (in entries)

R72(2007/12)	R76(2008/12)	R80(2009/12)	R84(2010/12)	R88(2011/12) Human		4,387,843,677 	4,516,579,802 	4,623,225,140 	4,746,436,836 	4,871,171,790 Primates	1,990,419,260 	1,123,094,304 	1,169,964,090 	1,241,619,169 	1,290,713,207 Mammal		353,036,144 	456,968,327 	601,162,981 	704,659,167 	827,310,516 Rodent		3,093,342,150 	4,153,109,888 	4,217,777,026 	4,332,325,942 	4,415,260,756 Vertebrates	1,977,955,182 	2,196,293,326 	2,409,746,959 	2,568,525,262 	2,736,438,170 Invertebrates	1,093,862,834 	1,240,076,508 	1,806,652,983 	2,094,439,957 	2,490,017,114 Plant		3,134,002,026 	3,702,771,027 	3,787,572,599 	4,174,846,428 	5,552,139,564 Bacteria	2,628,122,768 	3,349,467,931 	4,416,647,255 	5,659,727,311 	7,342,956,895 Viral		510,427,110 	654,659,102 	835,382,245 	1,018,635,428 	1,252,521,302 Phage		26,106,203 	32,444,882 	39,126,337 	48,751,320 	69,569,157 Environmental	451,585,697 	716,463,746 	1,279,684,813 	1,940,793,604 	2,662,200,445 Patent		2,600,411,410 	3,857,584,692 	6,169,519,650 	8,976,923,332 	11,447,354,630 Synthetic	75,883,002 	111,877,253 	136,220,312 	147,012,513 	922,229,249 Contig		0 		0 		0 		0 		0 EST		25,954,056,962 	32,295,799,302 	35,107,058,407 	37,288,737,529 	39,638,590,086 GSS		13,802,470,896 	15,957,796,257 	17,649,179,822 	19,623,876,771 	21,009,093,483 HTC		567,254,781 	607,907,061 	638,413,572 	660,766,470 	611,638,933 HTG		19,420,498,172 	23,142,053,901 	24,069,871,416 	24,318,568,498 	24,358,635,476 STS		524,486,630 	623,753,388 	629,618,960 	633,994,846 	635,972,107 TSA		0 		2,730,222 	49,551,403 	737,633,926 	2,821,816,096 UNA		480,583 	477,527 	486,282 	484,315 	480,073 total		82,592,245,487 	98,741,908,446 	109,636,862,252 120,918,758,624 134,956,109,049
 * Table 3. Growth of DDBJ release by division (in bases)

R72(2007/12)	R76(2008/12)	R80(2009/12)	R84(2010/12)	R88(2011/12) Human		4,387,843,677 	4,516,579,802 	4,623,225,140 	4,746,436,836 	4,871,171,790 Primates	1,990,419,260 	1,123,094,304 	1,169,964,090 	1,241,619,169 	1,290,713,207 Mammal		353,036,144 	456,968,327 	601,162,981 	704,659,167 	827,310,516 Rodent		3,093,342,150 	4,153,109,888 	4,217,777,026 	4,332,325,942 	4,415,260,756 Vertebrates	1,977,955,182 	2,196,293,326 	2,409,746,959 	2,568,525,262 	2,736,438,170 Invertebrates	1,093,862,834 	1,240,076,508 	1,806,652,983 	2,094,439,957 	2,490,017,114 Plant		3,134,002,026 	3,702,771,027 	3,787,572,599 	4,174,846,428 	5,552,139,564 Bacteria	2,628,122,768 	3,349,467,931 	4,416,647,255 	5,659,727,311 	7,342,956,895 Viral		510,427,110 	654,659,102 	835,382,245 	1,018,635,428 	1,252,521,302 Phage		26,106,203 	32,444,882 	39,126,337 	48,751,320 	69,569,157 Environmental	451,585,697 	716,463,746 	1,279,684,813 	1,940,793,604 	2,662,200,445 Patent		2,600,411,410 	3,857,584,692 	6,169,519,650 	8,976,923,332 	11,447,354,630 Synthetic	75,883,002 	111,877,253 	136,220,312 	147,012,513 	922,229,249 Contig		0 		0 		0 		0 		0 EST		25,954,056,962 	32,295,799,302 	35,107,058,407 	37,288,737,529 	39,638,590,086 GSS		13,802,470,896 	15,957,796,257 	17,649,179,822 	19,623,876,771 	21,009,093,483 HTC		567,254,781 	607,907,061 	638,413,572 	660,766,470 	611,638,933 HTG		19,420,498,172 	23,142,053,901 	24,069,871,416 	24,318,568,498 	24,358,635,476 STS		524,486,630 	623,753,388 	629,618,960 	633,994,846 	635,972,107 TSA		0 		2,730,222 	49,551,403 	737,633,926 	2,821,816,096 UNA		480,583 	477,527 	486,282 	484,315 	480,073 total		82,592,245,487 	98,741,908,446 	109,636,862,252 120,918,758,624 134,956,109,049
 * Table 4. Growth of DDBJ release by division (in bytes)

Web-form master-slave ftp Dec-11	2,161	26 Nov-11	2,303	26 Oct-11	2,536	32 Sep-11	2,297	22 Aug-11	2,889	26 Jul-11	2,473	28 Jun-11	2,476	39 May-11	3,036	29 Apr-11	2,732	34 Mar-11	2,651	18 Feb-11	2,351	27 Jan-11	2,009	26 Total	29,914	333 DDBJ service	2007	2008	2009	2010	2011	start counting	Service ended 1	DDBJ-HP		180,533 193,661 193,293 206,388 199,711 2	getentry	83,437 	72,180 	71,517 	75,868 	67,662 3	ARSA		8,702 	22,424 	36,060 	34,734 	22,892 3	BLAST		39,849 	54,758 	54,738 	53,616 	50,510 	2007/2～ 4	ClustalW	28,929 	39,109 	39,577 	41,616 	41,491 5	SAKURA		15,555 	14,864 	12,279 	11,383 	12,388 6	TXSearch	38,834 	35,819 	29,606 	24,614 	26,023 7	VecScreen	1,273 	1,657 	1,647 	1,924 	2,331 8	Anonymous FTP	1,333 	4,011 	3,928 	5,159 	12,260 9	DRA & DTA			1,558 	6,846 	23,855 	2009/3～ 10	GIB		61,865 	60,727 	49,875 	45,150 	15,981 11	GIB-V				3,465 	4,569 	4,395 	2009/3～ 12	GTPS				5,713 	23,791 	23,226 	2009/3～ 13	H-Inv		4,317 	1,686 	1,932 					2009/12/31 14	H-Inv Mirror	7,631 	17,819 	8,311 			2007/6～	2009/12/31 15	SRS		36,661 	26,094 						2008/12/26 16	FASTA		14,101 	17,825 	17,688 	4,363 				2010/3/31 17	PSI-BLAST	4,329 	6,867 	6,556 	1,487 				2010/3/31 18	SSEARCH		5,031 	6,618 	6,580 	1,462 				2010/3/31 19	HMMPFAM		2,854 	3,168 	2,888 	708 				2010/3/31 20	Sqmatch		1,804 	718 						2008/11/14 21	PDB Retriever	1,870 	3,872 						2008/11/14 22	LIBRA		4,268 	5,164 						2008/11/14 23	Lib score	183 	231 						2008/11/14
 * Table 5. Submission event (projects) to DDBJ in 2011
 * Table 4.  Annual Sum of Monthly unique visitors

Usercategory		1 file	"2-9"files	"10-99"files	>=100 files	uniq address 	total files DBCLS			1	0	0	7	8	48483 NIG (MextProject)	26	28	10	13	77	30145 RIKEN			3	0	1	1	5	2059 Japan 			27	10	6	2	45	2547 Non Japan EDU		27	10	3	0	40	196 Jpan Patent Office	0	0	0	2	2	1385 MITI (NITE) 		0	2	2	2	6	369 Ministry Agri		2	2	1	3	8	2004 Non Japan Gov		0	1	0	0	1	4 Japan Industry		12	7	1	3	23	1621 Non-Japan Industry	4	7	3	1	15	306 Provider JP		57	21	4	1	83	476 Provider Asia-Paci	42	10	5	0	57	81 Provider EU AM		10	7	0	0	17	33 Unknown Japan		6	6	2	1	15	413 Unknown Asia-Paci	2	2	0	1	5	374 Unknown Latin Am	4	2	0	0	6	9 Unknown			2	1	0	0	3	4 EBI 			3	0	0	0	3	3 NIH 			2	1	0	0	3	5 Admin			0	1	1	0	2	23 total uniqIP		230	118	39	37	424	90540
 * Table5. Unique retrievers of release files in 2011 by recursiveness.