The data management and interpreta`on challenge

advertisement
From personal genomics to personal diagnos1cs The data management and interpreta.on challenge Program 11:30 – 11:50 Terry Vrijenhoek Personal genomics -­‐ diagnos1c DNA sequencing 11:50 – 12:10 Peter Walgemoed The Dutch Health Hub 12:10 – 12:30 Maud Radstake Responsible Data Management for Personalised Diagnos1cs Personal genomics Diagnos.c DNA sequencing DNA developments 1953 2001 2010 2012 Utrecht DNA sequencing facility Hubrecht
Institute
WKZ
Children’s Hospital
UMC Utrecht
Stratenum
“It is 'me to develop guidelines to harmonise further implementa'on [of NGS into diagnos'cs]” – Work program Health Council “The primary goal of this program is to develop infrastructure, procedures and guidelines for NGS-­‐based personalized diagnos'cs for na'on-­‐
wide implementa'on.” – Grant applica1on CGD CARDIO – pilot project •  10 (9) pa.ents with cardiomyopathy •  DNA and medical reports •  Iden.fy need for policy Four phases; five topics 1.  Intake – samples and medical reports 2.  Sequencing – library prepara.on, enrichment and sequencing procedures 3.  Data analysis – tools and pipelines 4.  Diagnosis – interpreta.on and counseling 5.  Ethical and legal framework – consent, pa.ent rights, data protec.on, liability GENOME EXOME CARDIO GENE SET Detected variants !"#"
89&'
9BC(D3
9BC(D3
9BC(D3
9BC(D3
9BQ7
=&&R3
(8&
==&=2
=&&=2
'D=&2
TBT2
TBT2
9B(&
8%C3
'CDDI
9BQ7
9BQ7
XY(
%JD2
==&
==&
==&
==&
==&
==&
==&
==&
==&
==&
==&
==&
==&
==&
X(Q2
9B83
!8'
='Z
%9%
$%&'
$:113;!<=
$:2737E1!<'
$:2374H2375,#FD
$:23730?>
$:I27P2'<!
$:26;I!<'
$:4I7D<=
$:4;H420"G'!'
$:662=<D
$:6A3=<D
$:AI3!<'
$:6555E17D<=
$:A162=<D
$:1134D<=
$:1;A3'<D
$:366I!<'
$:4172'<D
$:2I45=<D
$:I;IE6D<=
$:A;2'<!
$:74571!<D
$:71;47D<=
$:6I;57'<!
$:6I;3ID<=
$:64;6;!<'
$:2I767!<'
$:253I7=<D
$:2;A;3!<D
$:I442D<=
$:I3;4D<=
$:I221!<'
$:7;2A=<!
$:776D<=
$:2I6P14=<D
$:64A!<'
$:476D<=
$:427!<'
$:5I4D<=
$:5723'<=
()*+",#
>:')-3778"?
F>G,$"
>:=)>7I2J")KF
>:=)>7I2KF/L/>:=)>7I2MNGKFO17
F>G,$"
>:')-A7;Q,F
>:J")166(S"
>:')-140"G
>:RG"221=S)
>:RG"22A=S)
>:')-2IAQ,F
>:RG"37ARG"
>:RG"2721=S)
>:U
>:=S)1223=S)
>:8WF13I1=S)
>:9"+IA2=S)
>:=S)26A'GN
>:!G#24A57Q,F
>:')-236A3DWF
>:U
>:'GN21354=S)
>:MNGII23RG"
>:MNGA466'GN
>:'F>6I35Q,F
>:')-314ADWF
>:')-31;2DWF
>:')-3;74!G#
>:9"+2343')>:()*25I8"?
>:8WF2168WF
>:=S)15I9"+
>:'GN143=S)
>:U
>:'F>1I;AMNG
-"#*.,$/$**)0
1@1561;5AA5
11@47357427
11@4735I27IH4735I2A;
11@4735I2A;
11@47367I23
14@23AI4;4A
1I@5566545;
6@11AAA;124H11AAA;126
1@2;1331;6A
1@2;1331;6A
1@236I;261A
1@2377I4A5A
1@237A21276
1;@6II;A113
1;@AA452311V
12@21IA1AI2
14@23AA6AI3
14@23AI2I1;
17@3II23625
1A@2A66667I
2@17I3II576
2@17I4;455;
2@17I4;A61I
2@17I4;A637
2@17I416372
2@17I463475V
2@17I473;1A
2@17I4A161A
2@17I631231V
2@17I632515
2@17I6325IA
2@17I63A72I
2@17I664352
2@17I665423
2;@427AA77I
3@46I;;I7;
O@1;;65674;
O@15364A3A1
O@32361267
1
1
;:5
1
1
1
1
1
1
1
1
;:5
1
1
;:1
&'
&'
1
1
1
1
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
1
1
1
1
2
3
4
5
6
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
&'
&'
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
&'
&'
1
1
1
1
1
1
1
1
&'
;:5
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
&'
1
1
1
1
1
1
1
1
1
1
1
1
&'
1
1
1
&'
1
1
1
1
1
1
1
1
&'
1
1
1
1
1
1
7
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
&'
1
1
1
1
1
1
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
1
1
1
&'
1
1
1
1
1
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
&'
1
&'
General observa1on The diagnos.c outcomes for the CARDIO pa.ents are generally similar… …despite the differences in sequencing approach, plaQorm and data analysis So, do we need guidelines? Numbers 3,154,971,228 56,217,673,809 2,885,536,685 2,208,479,859 2,155,162 114,938 5,585 937 111 112 ? Bases in human genome assembly Bases in SOLiD run (2 slides paired-­‐end) 18x Bases covered at least once (93.6%) Bases covered at least 10 x (~70%) SNPs Indels Non-­‐synonymous coding variants Splice sites affected Essen.al splice site affected Stop codon introduced Post-­‐CARDIO RESEARCH PROPOSALS FROM COMMUNITY P-­‐o-­‐C CENTRAL DATA INFRASTRUCTURE •  Ini.a.ve from EMC and LUMC in collabora.on with SURF/
DHH •  Exploring opportuni.es for central data infrastructure •  Features to test: •  Data transport •  Access to central storage •  Virtualisa.on •  Authen.ca.on •  Legal aspects TOWARDS RESPONSIBLE DATA MANAGEMENT IN PERSONALISED DIAGNOSTICS •  Collabora.on with Centre for Society and Genomics •  Mee.ng of minds: technology, clinic, ethics, society, legal •  To facilitate a proposal for a (European) program on societal responsible diagnos.c use of large-­‐scale biomedical data •  Kick-­‐off October 1 Thank you Marcel Mannens Ronald Lekanne Dit Deprez Raoul Hennekam Elcke Kranendonk Correne Ploem Wilbert van Workum Kees van der Berg Derek Butler Adalberto Costessi Bas Reichert Marjolein Kriek Johan den Dunnen Ken Kraaijeveld Gijs Santen Claudia Ruivenkamp Nienke van der Stoep Erik Sistermans Bauke Ylstra Remond Fijneman Jannek Weiss Quinten Waisfisz Frans Hogervorst Hans Kris.an Ploos van Amstel Jasper Saris Marjon van Slegtenhorst Wilfred van Ijcken Ingrid van der Laar Marja Wessels Maarten van den Berg Richard Sinke Morris Swertz Peter van Tintelen Jan Jongbloed Birgit Sikkema Marian Verkerk Nine Knoers Wim Dorlijn Winfried van Eijndhoven Rogier Drenth Gijs van Haaoen Edwin Cuppen Maartje Vogel Bert van der Zwaag Ellen van Binsbergen Mariëlle Swinkels Patrick van Zon Marcel Nelen Chris.an Gilissen Joep de Ligt Joris Veltman Helger Ijntema Ilse Feenstra Steven van Hove Ewout Ouwerkerk Bart de Koning Ingrid Kapels Bert Smeets Arthur van der Wijngaard Rick Kamps Suzanne Frints Marco Rijnen Jan Ghyssaert Simone Guenther Maud Radstake Wybo Dondorp Guido de Wert Tonnie Rijkers Colja Laane Graciela Mar.nez 
Download