546 |
18 Jan 08 |
nicklas |
1 |
---------------------------------------------------------------------- |
570 |
06 Feb 08 |
martin |
2 |
{{{ |
1166 |
28 Sep 09 |
jari |
Copyright (C) 2008 Nicklas Nordborg |
1166 |
28 Sep 09 |
jari |
Copyright (C) 2009 Jari Häkkinen, Nicklas Nordborg, Martin Svensson |
570 |
06 Feb 08 |
martin |
5 |
|
570 |
06 Feb 08 |
martin |
This file is part of Illumina plug-in package for BASE. |
570 |
06 Feb 08 |
martin |
Available at http://baseplugins.thep.lu.se/ |
570 |
06 Feb 08 |
martin |
BASE main site: http://base.thep.lu.se/ |
570 |
06 Feb 08 |
martin |
9 |
|
570 |
06 Feb 08 |
martin |
This is free software; you can redistribute it and/or |
570 |
06 Feb 08 |
martin |
modify it under the terms of the GNU General Public License |
940 |
27 Jan 09 |
martin |
as published by the Free Software Foundation; either version 3 |
570 |
06 Feb 08 |
martin |
of the License, or (at your option) any later version. |
570 |
06 Feb 08 |
martin |
14 |
|
570 |
06 Feb 08 |
martin |
The software is distributed in the hope that it will be useful, |
570 |
06 Feb 08 |
martin |
but WITHOUT ANY WARRANTY; without even the implied warranty of |
570 |
06 Feb 08 |
martin |
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the |
570 |
06 Feb 08 |
martin |
GNU General Public License for more details. |
570 |
06 Feb 08 |
martin |
19 |
|
570 |
06 Feb 08 |
martin |
You should have received a copy of the GNU General Public License |
941 |
27 Jan 09 |
martin |
along with BASE. If not, see <http://www.gnu.org/licenses/>. |
570 |
06 Feb 08 |
martin |
22 |
}}} |
546 |
18 Jan 08 |
nicklas |
23 |
---------------------------------------------------------------------- |
546 |
18 Jan 08 |
nicklas |
24 |
|
570 |
06 Feb 08 |
martin |
== Requirements == |
546 |
18 Jan 08 |
nicklas |
26 |
|
2286 |
13 Mar 14 |
nicklas |
1. BASE 3.3 or later. |
627 |
13 Mar 08 |
nicklas |
28 |
|
627 |
13 Mar 08 |
nicklas |
For expression experiments: |
627 |
13 Mar 08 |
nicklas |
30 |
|
570 |
06 Feb 08 |
martin |
2. Illumina Bead Summary (IBS) files. The IBS files contain quantified probe intensities. |
1032 |
07 Apr 09 |
nicklas |
3. Illumina Sentrix Array binary manifest (BGX) file. The BGX files contain probe annotations. |
570 |
06 Feb 08 |
martin |
33 |
|
627 |
13 Mar 08 |
nicklas |
For SNP experiments: |
627 |
13 Mar 08 |
nicklas |
35 |
|
627 |
13 Mar 08 |
nicklas |
2. Illumina SNP manifest files |
627 |
13 Mar 08 |
nicklas |
3. Illumina SNP raw data files |
627 |
13 Mar 08 |
nicklas |
38 |
|
570 |
06 Feb 08 |
martin |
Tested using Illumina BeadArray Reader (Version: 1.7.0.44) and BeadScan |
570 |
06 Feb 08 |
martin |
(Version: 3.5.31.17122) ## This is what we use in Lund. |
546 |
18 Jan 08 |
nicklas |
41 |
|
570 |
06 Feb 08 |
martin |
== Introduction == |
546 |
18 Jan 08 |
nicklas |
43 |
|
627 |
13 Mar 08 |
nicklas |
This README file contains general information about the plug-in package and |
627 |
13 Mar 08 |
nicklas |
specific information about expression data. See the README_SNP file specific |
627 |
13 Mar 08 |
nicklas |
information about SNP data. |
627 |
13 Mar 08 |
nicklas |
47 |
|
570 |
06 Feb 08 |
martin |
The Illumina BeadArray Reader is a scanner that can read arrays including Illumina |
1032 |
07 Apr 09 |
nicklas |
Sentrix BeadChips and Sentrix Array Matrices (SAMs). Operation of the BeadArray |
1166 |
28 Sep 09 |
jari |
Reader and image acquisition from Sentrix arrays is handled by the Illumina BeadScan |
570 |
06 Feb 08 |
martin |
software. |
546 |
18 Jan 08 |
nicklas |
52 |
|
570 |
06 Feb 08 |
martin |
The data output from a BeadArray Reader scanner by default consists of files |
570 |
06 Feb 08 |
martin |
including image data (IDAT) files that can be read by data analysis software such |
570 |
06 Feb 08 |
martin |
as the Illumina BeadStudio software. |
546 |
18 Jan 08 |
nicklas |
56 |
|
1032 |
07 Apr 09 |
nicklas |
The Illumina plug-ins package for BASE reads Illumina Sentrix Array data from |
570 |
06 Feb 08 |
martin |
Illumina Bead Summary (IBS) files. The IBS files are not by default outputted by |
570 |
06 Feb 08 |
martin |
the BeadArray Reader and the scanner must be configured to do so. Once the |
570 |
06 Feb 08 |
martin |
BeadArray Reader is configured it will output IBS files in addition to any default |
570 |
06 Feb 08 |
martin |
output files. To configure a BeadArray Reader to output IBS files, users are asked |
570 |
06 Feb 08 |
martin |
to contact their local Illumina Field Application Scientist. |
570 |
06 Feb 08 |
martin |
63 |
|
1032 |
07 Apr 09 |
nicklas |
The IBS files are text files that contain bead-type level data for scanned Sentrix |
570 |
06 Feb 08 |
martin |
arrays. The file format is explained in detail in the section Illumina Bead Summary |
570 |
06 Feb 08 |
martin |
files. |
570 |
06 Feb 08 |
martin |
67 |
|
570 |
06 Feb 08 |
martin |
== Illumina Bead Summary (IBS) files == |
570 |
06 Feb 08 |
martin |
69 |
|
1032 |
07 Apr 09 |
nicklas |
The IBS files contain bead-type level data for scanned Sentrix arrays. |
570 |
06 Feb 08 |
martin |
The IBS files are simple comma separated text files with file extension .csv. |
570 |
06 Feb 08 |
martin |
The IBS files are outputted by the BeadArray Reader in the same directory as any |
570 |
06 Feb 08 |
martin |
additional data files from a scan. Note that IBS files are not outputted by a |
570 |
06 Feb 08 |
martin |
BeadArray Reader with default settings. Contact a local Illumina Field Application |
570 |
06 Feb 08 |
martin |
Scientist to configure the scanner to output IBS files. |
570 |
06 Feb 08 |
martin |
76 |
|
570 |
06 Feb 08 |
martin |
IBS files are composed of four comma separated columns. See below for an example |
570 |
06 Feb 08 |
martin |
IBS file including header and 3 rows of data. |
570 |
06 Feb 08 |
martin |
79 |
{{{ |
580 |
11 Feb 08 |
nicklas |
Illumicode,N,Mean GRN,Dev GRN |
580 |
11 Feb 08 |
nicklas |
10008,26,222,47 |
580 |
11 Feb 08 |
nicklas |
10010,16,57,11 |
580 |
11 Feb 08 |
nicklas |
10014,16,56,13 |
570 |
06 Feb 08 |
martin |
84 |
}}} |
580 |
11 Feb 08 |
nicklas |
85 |
|
570 |
06 Feb 08 |
martin |
The column content in an IBS file is described below. |
570 |
06 Feb 08 |
martin |
- ''' Illumicode ''': A code corresponding to the Array_Address_Id in the |
1166 |
28 Sep 09 |
jari |
Illumina Sentrix Array binary manifest (BGX) file. Note that the Illumicode |
1032 |
07 Apr 09 |
nicklas |
is a string (or integer) of varying length. The Array_Address_Id is a string with |
1166 |
28 Sep 09 |
jari |
a fixed length of 10 characters that consists of an Illumicode padded with zeros. |
570 |
06 Feb 08 |
martin |
- ''' N ''': The total number of beads used to calculate Mean GRN and Dev GRN. |
570 |
06 Feb 08 |
martin |
- ''' Mean GRN ''': The mean intensity. |
570 |
06 Feb 08 |
martin |
- ''' Dev GRN ''': Standard deviation of the mean intensity. |
570 |
06 Feb 08 |
martin |
94 |
|
1166 |
28 Sep 09 |
jari |
IBS files may contain some rows with Illumicode that are not represented in the |
1093 |
27 May 09 |
martin |
BGX files. |
1093 |
27 May 09 |
martin |
Our interpretation is that some probes that used to be annotated with a gene has |
1093 |
27 May 09 |
martin |
later been considered poor by Illumina (as we have only observed an increase |
1166 |
28 Sep 09 |
jari |
in the number of unmatched probes for later BGX revisions). To avoid that the |
1093 |
27 May 09 |
martin |
raw data importer fails cause of some probes, listed in the IBS-file, couldn't |
1093 |
27 May 09 |
martin |
be found in BASE - you have to set plug-in parameter 'Probe not found=skip', |
1093 |
27 May 09 |
martin |
when importing data from an IBS-file into BASE. |
1093 |
27 May 09 |
martin |
103 |
|
580 |
11 Feb 08 |
nicklas |
A new raw data type has been defined in illumina-raw-data-types.xml |
580 |
11 Feb 08 |
nicklas |
to hold this kind of data. The name of the raw data type is |
580 |
11 Feb 08 |
nicklas |
'''Illumina Bead Summary (IBS)''' and the unique ID is '''illumina_bead_summary''' |
580 |
11 Feb 08 |
nicklas |
107 |
|
580 |
11 Feb 08 |
nicklas |
108 |
|
1032 |
07 Apr 09 |
nicklas |
== Illumina Sentrix Array binary manifest (BGX) files == |
570 |
06 Feb 08 |
martin |
110 |
|
1032 |
07 Apr 09 |
nicklas |
In addition to IDAT files, BeadStudio requires Illumina Sentrix Array binary |
570 |
06 Feb 08 |
martin |
manifest (BGX) files that contain information about the probes on a specific |
1032 |
07 Apr 09 |
nicklas |
Illumina Sentrix Array, including gene symbol, probe sequence, and so on. |
570 |
06 Feb 08 |
martin |
In BASE, the BGX files are used to create array designs that describe the probe |
1032 |
07 Apr 09 |
nicklas |
content of a specific Illumina Sentrix Array. |
570 |
06 Feb 08 |
martin |
116 |
|
570 |
06 Feb 08 |
martin |
BGX files are tab separated text files composed of 3 sections named Heading, |
570 |
06 Feb 08 |
martin |
Probes, and Controls respectively. The first section is the Heading section. |
1166 |
28 Sep 09 |
jari |
It is preceded by a row containing the text [Heading]. In the Heading section |
570 |
06 Feb 08 |
martin |
some information is presented including the number of Probes and Controls |
570 |
06 Feb 08 |
martin |
described in the BGX file. See below for an example of the Heading section. |
570 |
06 Feb 08 |
martin |
122 |
{{{ |
580 |
11 Feb 08 |
nicklas |
[Heading] |
580 |
11 Feb 08 |
nicklas |
Date 1/3/2007 |
580 |
11 Feb 08 |
nicklas |
ContentVersion 1.0 |
580 |
11 Feb 08 |
nicklas |
FormatVersion 1.0.0 |
580 |
11 Feb 08 |
nicklas |
Number of Probes 48701 |
580 |
11 Feb 08 |
nicklas |
Number of Controls 1426 |
570 |
06 Feb 08 |
martin |
129 |
}}} |
1166 |
28 Sep 09 |
jari |
Following the Heading section is the Probes section which is preceded by a row |
570 |
06 Feb 08 |
martin |
containing the text [Probes]. The first row of the Probes section, i.e., the row |
570 |
06 Feb 08 |
martin |
after [Probes] contain the header for the Probes section. Following the Probes |
1166 |
28 Sep 09 |
jari |
section is the Controls section which is preceded by a row containing the text |
570 |
06 Feb 08 |
martin |
[Controls]. The first row of the Controls section, i.e., the row after [Controls] |
570 |
06 Feb 08 |
martin |
contain the header for the Controls section. Note that the header row for the |
570 |
06 Feb 08 |
martin |
Controls section is completely different that the header row for the Probes section. |
570 |
06 Feb 08 |
martin |
See below for an example of Probes header and Controls header and how information |
570 |
06 Feb 08 |
martin |
in the BGX file is mapped to BASE. |
570 |
06 Feb 08 |
martin |
139 |
|
570 |
06 Feb 08 |
martin |
== Mapping reporter/control annotations from BGX files to BASE == |
570 |
06 Feb 08 |
martin |
141 |
|
580 |
11 Feb 08 |
nicklas |
The table below shows how the [Probes] section in the BGX file are mapped to |
580 |
11 Feb 08 |
nicklas |
reporter annotations in BASE. Annotations in <brackets> are new annotations |
580 |
11 Feb 08 |
nicklas |
defined in the illumina-extended-properties.xml file. BGX columns marked |
580 |
11 Feb 08 |
nicklas |
with - are not mapped to BASE. |
570 |
06 Feb 08 |
martin |
146 |
|
580 |
11 Feb 08 |
nicklas |
|| '''BGX column''' || '''BASE reporter annotation''' || '''Example value''' || |
580 |
11 Feb 08 |
nicklas |
|| Species || Species || Homo sapiens || |
580 |
11 Feb 08 |
nicklas |
|| Source || <Source> || RefSeq || |
580 |
11 Feb 08 |
nicklas |
|| Search_Key || <Search_Key> || ILMN_5998 || |
580 |
11 Feb 08 |
nicklas |
|| Transcript || - || ILMN_5998 || |
580 |
11 Feb 08 |
nicklas |
|| ILMN_Gene || <ILMN_Gene> || BRCA1 || |
580 |
11 Feb 08 |
nicklas |
|| Source_Reference_ID || <Source_Reference_ID> || NM_007301.2 || |
580 |
11 Feb 08 |
nicklas |
|| RefSeq_ID || RefSeq || NM_007301.2 || |
580 |
11 Feb 08 |
nicklas |
|| Unigene_ID || Cluster ID || || |
580 |
11 Feb 08 |
nicklas |
|| Entrez_Gene_ID || LocusLink || 672 || |
580 |
11 Feb 08 |
nicklas |
|| GI || - || 63252878 || |
580 |
11 Feb 08 |
nicklas |
|| Accession || Accession || NM_007301.2 || |
580 |
11 Feb 08 |
nicklas |
|| Symbol || Gene symbol || BRCA1 || |
580 |
11 Feb 08 |
nicklas |
|| Protein_Product || - || NP_009232.1 || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Id || External ID || ILMN_1738027 || |
580 |
11 Feb 08 |
nicklas |
|| Array_Address_Id || Feature ID * || 0003120095 || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Type || <Isoform_Type> || A || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Start || - || 6438 || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Sequence || Sequence || ATCCAGGACTGTTTATAGCTGTTGGAAGGACTAGGTCTTCCCTAGCCCCC || |
580 |
11 Feb 08 |
nicklas |
|| Chromosome || Chromosome || 17 || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Chr_Orientation || <Probe_Chr_Orientation> || || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Coordinates || <Probe_Coordinates> || 38449935-38449984 || |
580 |
11 Feb 08 |
nicklas |
|| Definition || Description || Homo sapiens breast cancer 1, early onset (BRCA1), transcript variant BRCA1-delta15-17, mRNA. || |
580 |
11 Feb 08 |
nicklas |
|| Ontology_Component || GO cell location || ubiquitin ligase complex [goid 151] [pmid 14976165] [evidence NAS]; ... || |
580 |
11 Feb 08 |
nicklas |
|| Ontology_Process || GO biological process || protein ubiquitination [goid 16567] [pmid 15905410] [evidence NAS]; ... || |
580 |
11 Feb 08 |
nicklas |
|| Ontology_Function || GO molecular function || metal ion binding [goid 46872] [evidence IEA]; ... || |
580 |
11 Feb 08 |
nicklas |
|| Synonyms || <Synonyms> || IRIS; PSCP; BRCAI; BRCC1; RNF53 || |
570 |
06 Feb 08 |
martin |
174 |
|
580 |
11 Feb 08 |
nicklas |
The table below shows how the [Controls] section in the BGX file are mapped to |
580 |
11 Feb 08 |
nicklas |
reporter annotations in BASE. Annotations in <brackets> are new annotations |
580 |
11 Feb 08 |
nicklas |
defined in the illumina-extended-properties.xml file. BGX columns marked |
580 |
11 Feb 08 |
nicklas |
with - are not mapped to BASE. |
570 |
06 Feb 08 |
martin |
179 |
|
580 |
11 Feb 08 |
nicklas |
|| '''BGX column''' || '''BASE reporter annotation''' || '''Example value ''' || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Id || External ID || ILMN_943471 || |
580 |
11 Feb 08 |
nicklas |
|| Array_Address_Id || Feature ID * || 0004780609 || |
580 |
11 Feb 08 |
nicklas |
|| Reporter_Group_Name || <Control_Group_Name> || housekeeping || |
580 |
11 Feb 08 |
nicklas |
|| Reporter_Group_id || <Control_Group_Id> || housekeeping || |
580 |
11 Feb 08 |
nicklas |
|| Reporter_Composite_map || <Control_Composite_map> || GI_34304116-S || |
580 |
11 Feb 08 |
nicklas |
|| Probe_Sequence || Sequence || CGTGAAGACCCTGACTGGTAAGACCATCACTCTCGAAGTGGAGCCGAGTG || |
570 |
06 Feb 08 |
martin |
187 |
|
580 |
11 Feb 08 |
nicklas |
* The Feature ID is not a reporter annotation. It is used only to |
580 |
11 Feb 08 |
nicklas |
identify the probe on an array design. |
580 |
11 Feb 08 |
nicklas |
190 |
|
580 |
11 Feb 08 |
nicklas |
The column mappings for the [Probes] section can be changed by modifying |
580 |
11 Feb 08 |
nicklas |
the existing import configuration or creating a new configuration. The |
580 |
11 Feb 08 |
nicklas |
column mappings for [Controls] section can't be changed. |
580 |
11 Feb 08 |
nicklas |
194 |
|
580 |
11 Feb 08 |
nicklas |
== Getting started == |
580 |
11 Feb 08 |
nicklas |
196 |
|
1166 |
28 Sep 09 |
jari |
1. Install this package as described by the instructions in the |
1166 |
28 Sep 09 |
jari |
INSTALL file. |
1166 |
28 Sep 09 |
jari |
2. Optionally, if not already done, disable use of the 'Illumina' raw |
1166 |
28 Sep 09 |
jari |
data type provided by a standard BASE installation; Simply remove, |
1166 |
28 Sep 09 |
jari |
or comment out, the `<raw-data-type>` entry for `id='illumina'` in |
1166 |
28 Sep 09 |
jari |
configuration file |
1166 |
28 Sep 09 |
jari |
`/path/to/base/WEB-INF/classes/raw-data-types.xml` and restart the |
1166 |
28 Sep 09 |
jari |
BASE server. If a user already used the 'Illumina' raw data type, |
1166 |
28 Sep 09 |
jari |
BASE will display an error message in the GUI. Either restore the |
1166 |
28 Sep 09 |
jari |
use of 'Illumina' raw data type or remove the data using |
1166 |
28 Sep 09 |
jari |
'Illumina' raw data type. |
1166 |
28 Sep 09 |
jari |
3. Import reporter annotations. You will need one or more BGX files for this. |
583 |
12 Feb 08 |
nicklas |
BGX files can be downloaded from http://www.switchtoi.com/annotationfiles.ilmn. |
580 |
11 Feb 08 |
nicklas |
* Upload the BGX file(s) to BASE. |
580 |
11 Feb 08 |
nicklas |
* Go to the View -> Reporters menu. |
580 |
11 Feb 08 |
nicklas |
* Click on the Import button. |
580 |
11 Feb 08 |
nicklas |
* Use the auto-detect function or select the Illumina BGX reporter importer plug-in. |
580 |
11 Feb 08 |
nicklas |
* Select the BGX file. |
580 |
11 Feb 08 |
nicklas |
* Finish the job registration and wait for the plug-in to complete. |
580 |
11 Feb 08 |
nicklas |
* Repeat this one time for each BGX file. |
1166 |
28 Sep 09 |
jari |
4. Create array designs. You will need one array design for each BGX file. |
580 |
11 Feb 08 |
nicklas |
* Go to the Array LIMS -> Array designs menu. |
580 |
11 Feb 08 |
nicklas |
* Click on the New button. |
580 |
11 Feb 08 |
nicklas |
* Choose the Illumina/Expression 1 or the Illumina/Expression 2 platform. The difference |
580 |
11 Feb 08 |
nicklas |
is that the Expression 2 has two IBS files for each raw data set, but Expression 1 |
580 |
11 Feb 08 |
nicklas |
only has one. |
580 |
11 Feb 08 |
nicklas |
* We recommend that you give the array design the same name as the BGX file. |
580 |
11 Feb 08 |
nicklas |
* Switch to the Data files tab and select the BGX file. |
580 |
11 Feb 08 |
nicklas |
* Click on Save. |
580 |
11 Feb 08 |
nicklas |
* Click on the newly created array design. |
580 |
11 Feb 08 |
nicklas |
* Click on the Import button and select the Illumina BGX feature importer plug-in. |
580 |
11 Feb 08 |
nicklas |
* Click on Next and select the Duplicate feature=skip option. |
580 |
11 Feb 08 |
nicklas |
* Finish the job registration and wait for the plug-in to complete. |
580 |
11 Feb 08 |
nicklas |
* Repeat this for each BGX file. |
1166 |
28 Sep 09 |
jari |
5. Import raw data. You will need one or two IBS files. |
580 |
11 Feb 08 |
nicklas |
* Upload the IBS file(s) to BASE. |
580 |
11 Feb 08 |
nicklas |
* Go to the View -> Raw bioassays menu. |
580 |
11 Feb 08 |
nicklas |
* Click on the New button. |
580 |
11 Feb 08 |
nicklas |
* Select the Illumina/Expression 1 or the Illumina/Expression 2 platform. The difference |
580 |
11 Feb 08 |
nicklas |
is that the Expression 2 has two IBS files for each raw data set, but Expression 1 |
580 |
11 Feb 08 |
nicklas |
only has one. |
580 |
11 Feb 08 |
nicklas |
* Select one of the array designs created in step 3. |
580 |
11 Feb 08 |
nicklas |
* Switch to the Data files tab and select the IBS file(s). |
580 |
11 Feb 08 |
nicklas |
* Click on Save. |
580 |
11 Feb 08 |
nicklas |
* Click on the newly created raw bioassay. |
580 |
11 Feb 08 |
nicklas |
* Click on the Import button and select the Illumina Bead Summary Importer |
580 |
11 Feb 08 |
nicklas |
* Finish the job registration and wait for the plug-in to complete. |
580 |
11 Feb 08 |
nicklas |
* Repeat this for each set of raw data files. |
1166 |
28 Sep 09 |
jari |
6. Add your raw data sets to an experiment. |
580 |
11 Feb 08 |
nicklas |
246 |
|
1196 |
05 Mar 10 |
jari |
Tip! Steps 1-4 only needs to be done a single time for a BASE |
1196 |
05 Mar 10 |
jari |
installation. If more than one user is going to use the Illumina |
1196 |
05 Mar 10 |
jari |
package we recommend that the array designs created in step 4 are |
1196 |
05 Mar 10 |
jari |
shared to the appropriate users, for example, the Everyone group. |
580 |
11 Feb 08 |
nicklas |
251 |
|
1196 |
05 Mar 10 |
jari |
Tip! The data import step in (5) above can be done for an entire |
1196 |
05 Mar 10 |
jari |
experiment at a time. |