GeneScreen 
Walk-through guide
Note: This program requires the Microsoft 
.NET framework version 2 or later to be installed on your computer. 
This walk-through of the program uses the sample files in the GeneScreen_sample_files folder. There are five main sections 
to this guide: 
  - The creation of reference sequence files 
 
  - Searching for mutations/SNPs in multiple ABI files 
 
  - Searching a sequence for heterozygous indels 
 
  - Exporting data to a file or database (LOVD) 
 
  - Analyzing a single sequence file 
  
1. Creating reference sequences
  - In this Section:
  
  - 1.1. Importing mRNA sequence information 
 
  - 1.2. Importing genomic sequence information 
 
  - 1.3. Primer design 
 
  - 1.4. Choosing the PCR products 
  
  
Figure 1: The start-up screen.  
When the GeneScreen program starts, the tab for 
analysis of sequences is initially shown (Figure 1). This is because this tab 
will most likely be used multiple times for analysis of ABI sequences. The 
creation of a set of reference files, in contrast, will in general be done only 
once. 
To create reference sequences, click the Reference files tab at the right of the tab-bar (Figure 
2). 
 
  
Figure 2: Reference sequence production window  
Reference sequences can simply be plain text sequence files. However, if you 
wish to screen a gene whose cDNA and genomic sequences are both available, you 
can make reference files that contain additional information on the open reading 
frame, exons/introns and protein sequence. 
 
1.1. Importing mRNA sequence information
  
Figure 3: Selecting the correct ORF  
To create an annotated sequence file, press the mRNA button (Figure 2) and select the file in the BRCA2 source sequences folder called cDNA.txt. Then, press the “Get ORF position” Select button. This will screen the sequence for open 
reading frames (Figure 3). The window in Figure 3 displays six different sets of 
coloured lines, representing the possible reading frames. Only the forward 
frames are available for selection. If the desired ORF is on the reverse 
complement strand, close the window and press the Invert button (Figure 2, three buttons below mRNA) and then press the Select 
button again. 
To select an ORF, place the cursor over the appropriate line (in Figure 3 
it's the longest (yellow) line) and click the left mouse button. The selected 
line will then become hollow. If you want to check that you indeed have the 
correct ORF, press the Check button, whereupon the 
window in Figure 4 will be displayed. This form allows you to select options for 
generating an annotated text file showing the protein and mRNA sequence 
alignment. The format of this output file will change according to the options 
selected from the form, and will be shown in preview form in the bottom left of 
the window. (The sequence displayed is a generic one, not the actual selected 
sequence.) To generate the output file, press the Create button. To accept the ORF, close the window shown in 
Figure 4 (if opened) and confirm by clicking the Accept button (Figure 3). 
 
  
Figure 4: Checking the ORF  
If you do not wish to include any genomic sequence in the reference files, 
you can create the reference sequences now, by either designing or importing a 
list of primers that you intend to use in the PCR amplification/sequencing of 
the mRNA. This process is the same as for creating files with genomic sequence 
information, so this step is illustrated later. 
 
1.2. Importing genomic sequence information
  
Figure 5: Finding exons  
Just as reference files may be created without genomic sequence information, 
they may be created without mRNA sequence information, by omitting the steps in 
Section 1.1. To import a genomic sequence, press the button labelled Genomic (Figure 2) and select a plain text sequence file 
corresponding to the genomic region (including 5'-UTR, 3'-UTR, introns and 
exons). For the purposes example, use the file called Genomic.txt. To find the positions of exons, press the Find button (only if an mRNA sequence has already been 
added). This will display the window shown in Figure 5. If you get a message 
that says no alignment was found, press the Invert 
button (three buttons below the Genomic button) and 
try again. 
The upper green box represents the position of the ORF within the mRNA, while 
the blue triangles show the regions of sequence alignment between the mRNA 
(upper line) and the genomic sequence (lower line). To check the alignment, 
press the Check button and save the alignment to disk. 
(To reduce the size of the alignment file, unaligned intronic and intragenic 
sequence is removed, apart from short regions that flank the exons.) If the 
alignment is correct, press the Accept button. 
 
1.3. Primer design
Once you have created the alignments, the relevant Primers buttons shown in Figure 2 will be enabled, 
allowing you to design primers. However, if you wish to design primers using a 
different software package, you can alternatively import primers as plain text, 
in the tab-delimited format used in the Primers.txt 
file. To import primers, under Primer options 
press the Primers button (third from the bottom). Or, 
to design primers using GeneScreen, press one of the 
other Primers buttons (either under mRNA options or Genomic DNA options) which will cause the following 
form to be displayed: 
 
  
Figure 6: Primer design  
The primer design window contains three regions. 
The upper region (A in Figure 6) shows the calculated Tm 
for primers (red line, indicated by blue arrow) of the currently selected length 
as shown in the text box (blue ellipse). 
The middle region (B) shows the sequence currently in view (including 
any protein translation where appropriate) and the lower panel (C) shows 
a scaled graphic of the total genomic region. In these panels, green areas 
represent protein-coding regions and blue areas indicate 5'- and 3'-UTR 
sequences. The lower panel shows all the exons etc. (In this image the exons are 
difficult to see.) The positions of forward and reverse primers are shown by 
blue and red lines respectively. 
The user can navigate along the sequence either using the scroll bar (red 
arrow) or the < and > 
buttons (red ellipse). The latter buttons move the display to the next sequence 
feature i.e. an intron/exon splice site, start/stop codon or start/end of 
the mRNA. 
To create a primer, place the cursor over the primer site and click the left 
mouse button. The chosen primer’s sequence and Tm will be displayed 
in the title bar of the window, and the corresponding sequence in panel B will 
be coloured red. The primer length can be altered using the text box highlighted 
with a blue ellipse in Figure 6. After entering a name for the primer, tick the 
box labelled “Reverse strand”, if the primer is for the reverse strand, and 
finally click the Add button. The chosen primer will 
now appear in the Primers panel (labelled D in Figure 6); as in panel C, 
forward primers appear in blue and reverse primers in red. 
To save the primers, press the Close button. 
 
1.4. Choosing the PCR products
  
Figure 7: Selection of PCR products  
Once primers have been entered, either selected as described above or 
imported from a file, the set of PCR products can be chosen, which it is desired 
to convert into reference files. To do this, click either the RT PCR or PCR button (Figure 2). The RT PCR option will look for PCR products within the 
mRNA sequence, while the PCR option screens the 
genomic sequence. Selecting one of these buttons generates a form of the type 
shown in Figure 7: initially, the large text box will be empty. 
Adjust the minimum and maximum product size values, so that excessively long 
or short products are ignored, and then press the Find 
button. The window will then display a list of possible PCR products. From this 
list, select the products that you wish to convert into files, and click Accept. The form will close and the Save button will be activated. Pressing this button will 
prompt the user to select a folder to save the reference files to, and then 
generate these files, along with a file containing the names and sequences of 
the primers used to create these files. 
2. Searching for mutations or SNPs in multiple ABI files
  - In this Section: 
 
  - 2.1. Selecting sequences and analysis parameters 
 
  - 2.2. Interacting with the Results grid 
 
  - 2.3. The View window 
 
  - 2.4. The Edit form 
 
  - 2.5. The Annotate form 
 
  - 2.6. Genotyping a SNP 
 
  - 2.7. Analysis of  rejected sequences 
  
  
2.1. Selecting sequences and analysis parameters
To search a number of ABI trace files (*.AB1) for mutations, place them all 
in a single folder and create a reference file. If you choose to use an 
unannotated file, the sequence should have a maximum length of 1000 bp. Select 
the Sequences tab (Figure 1) and use the 
upper Select button to pick the desired reference 
file. Next, use the lower Select button to pick the 
folder containing the trace files. This folder should be on a local hard disk, 
and not a network drive, since the operating system may restrict the program’s 
access to these files. Select the “heterozygous cut-off” value; this refers to 
the ratio of the two allelic peak heights at a heterozygous position. If the 
trace files have a high background, this cut-off value should be increased in 
order to reduce the number of false positives.  
Also, through the next drop-down listbox on the Sequences tab (Figure 1), it is possible 
to set a maximum permitted number of sequence variants per trace. If this number 
is exceeded, that trace file is deemed to have “failed” (see Section 2.7). 
If a sequence cannot be aligned to the reference file, it can be checked for 
the presence of a heterozygous indel. To use this option, tick the last checkbox 
on the Sequences tab (Figure 1). 
  
Figure 8: Selecting a region of interest for 
mutations  
To restrict sequence variant highlighting to a particular region of the 
reference file, press the Limit button, whereupon the 
window shown in Figure 8 will appear. The solid black bar represents the 
reference sequence, while the pale blue and green bars represent any exons or 
protein coding sequences (respectively) in the reference file. The selected 
region lies between the blue and red vertical lines, the positions of which can 
be moved either by positioning the cursor over the image and pressing a mouse 
button, or by changing the base pair positions selected in the list-boxes at the 
bottom right of the window. The chosen region is accepted by pressing the OK button. 
 
Finally, (returning to the Sequences tab) press Analyse. The progress of the program will be shown in the 
progress bar below the Analyse button. When all the 
trace files have been screened, the number of files that were aligned will be 
displayed in a message above the progress bar on the Sequences tab (Figure 1). 
  
Figure 9a: The results data set  
When analysis is complete, the program will switch to the Results tab (Figure 9a). Unlike the other tab views, this can 
be resized (Figure 
9b) to allow the user to see more data. 
Figures 9a and 9b show the appearance of the grid after analysis of the 
sample files contained in the folder BRCA2 exon 23, using reference file 23f to 23r start 73141 length 269.ref 
(also in this folder). 
In this display, each row represents a file, and each column a basepair 
position that contains at least one (possible) mutation in the analysed 
sequences. The number at the top of each column identifies the position of the 
(possible) mutation within the reference file. If you already chose, as 
described above, to 
define a region of interest, any sequence variant located outside the chosen 
region will ignored. 
To identify the file represented by each row, move the cursor over the blue 
bar at the left of the window. If the number of rows or columns exceeds the 
viewable area of the form, the grid can be moved using the vertical and 
horizontal scroll bars, at the right and bottom of the grid. Each cell in the 
grid is colour coded: 
 
  
  
     | 
    Nucleotides that lie within an exon are blue. |  
  
     | 
    Light grey represents positions that were not aligned 
    in that trace file. |  
  
     | 
    Dark grey cells represent sequences that could not be 
    aligned at all to the reference sequence. |  
  
     | 
    Red cells identify mutant nucleotides. |  
  
 
Moving the cursor over the cells in the data grid causes a small window to 
appear, displaying additional information about the underlying nucleotide, such 
as any predicted amino acid change. Clicking the left mouse button invokes a 
floating menu, as shown (Figure 9a), from which each menu item causes a separate 
window to be displayed, as described below 
  
Figure 9b: Results tab window is 
resizable  
This figure shows the appearance after analysis of the files in the folder 
BRCA2 exon 23, using reference file 23f to 23r start 73141 length 269.ref, 
also in this folder. 
 
2.2. Interacting with the ‘Results’ grid
The program has a number of features designed to speed up the screening of 
large numbers of sequences for possible mutations. These functions fall into two 
main categories: 
  - Dynamic display of the underlying trace data 
 
  - Updating the Results grid to display the action 
  taken on a sequence variant 
  
 
2.2.1. Dynamic display of trace data
  
Figure 10: Dynamic display of sequence flanking a possible 
mutation  
Pressing the SNP button (to the lower right on the 
Results tab) displays the View 
window (Figure 10). This window is described in greater detail below; however, 
in this mode, the image in the window is dynamically linked to the cursor 
position over the Results grid. As the cursor moves from 
one cell to another, the image in the View window thus 
changes to display the variant position and flanking sequence corresponding to 
that cell.  
 
  
Figure 11a: Selecting control traces to compare with the 
query sequence  
Alternatively, if you also want to compare the variant position to a 
corresponding control sequence, click SNP/C. This 
button operates in a similar manner to the previous one, but before the 
dynamically linked View window is displayed, a control 
sequence must be selected (Figure 11a). Typically, one would select both a 
forward and reverse control sequence, but it is possible to select just one 
trace. 
 
  
Figure 11b: Dynamic display of a possible mutation compared 
to a control trace  
As before, the trace shown as the upper image (Figure 11b) is now linked to 
the position of the cursor, while the lower image displays the control sequence 
trace at the same position. (If you selected both forward and reverse control 
sequences, the correct control sequence, matching the direction of the test 
sequence, will be displayed as the control image.) 
 
2.2.2. Editing the ‘Results’ grid to display tasks performed on a 
mutation
In the Results grid, the sequences are arranged in 
alphabetical order. Typically, therefore, if a patient ID or name is the first 
variable part of the file name (e.g. ID2323_Rev_Brac1_exon23.ab1 
or exon1_AFM_f.abi), the list will alternate between forward and reverse 
sequences for each individual. However, if the sequence direction is placed 
before the patient ID in the file name (e.g. 
Brac1_exon23_Rev_.ID2323ab1 or exon1_f_AFM.abi), all forward 
sequences will be grouped together, followed by the reverse sequences. Finally, 
if the first variable part of the file name is related neither to the sequence 
direction or patient ID, the trace will appear to be randomly placed in the 
Results grid. 
  
Figure 12: True mutations appear as pairs in alternating 
forward and reverse reads  
Figure 12 shows a region of a typical grid in which the sequences alternate 
between forward and reverse reads for each individual. Two red cells indicating 
possible mutations appear together, suggesting that they represent a true 
sequence variant, seen both in forward and reverse direction. To confirm that 
they do belong to the same individual, it is possible simply to view the 
underlying sequence data (as shown above) and read the file names, shown in the 
form title-bar. Alternatively, clicking the Link 
button (centre bottom of the grid view in Figure 9a) draws a blue 
box around the matching pairs. If forward and reverse sequences alternate, one 
box containing both variants is drawn (B). Otherwise, a separate blue box is 
drawn around each matched variant. Since this function depends on correct 
recognition of matching forward and reverse name pairs, with some naming schemes 
for data files it may not work correctly. If you decide to annotate and export 
the variant, a light green rectangle is drawn to the left of the cell (C).  
 
  
Figure 13: Marking a sequence variant as 
"dismissed"  
When analysing a large number of sequence files, it is possible to forget 
which variants have been examined and what action was taken. To remedy this, the 
grid may be annotated as the underlying data are being viewed. The simplest 
annotation is to mark a variant as an artefact, by simply placing the cursor 
over the cell and pressing the ‘m’ key. This marks the cell with a dark 
green rectangle, indicating that it has been viewed and the sequence variant has 
been disregarded (Figure 13). 
 
  
Figure 14: Suppressing all instances of a selected 
variant  
If your sequence contains a common non-pathogenic SNP, or is subject to a 
sequence-specific artefact, it is possible to suppress all instances of the same 
sequence variant. To do this, place the cursor over a cell that exhibits the 
variant, and type ‘Ctrl-S’. Any variants at the same position in the 
reference file, that were sequenced in the same direction and exhibit the same 
sequence variant, will be redrawn in pink rather than red. 
 
  
Figure 15: Ctrl-R to manually reject an aligned 
sequence  
If the program managed to align a low quality sequence to the reference file, 
it is still possible to manually reject the file. To do this, either press 
‘Ctrl-S’ or left-click on the row and select . Manually rejected sequences are then identified 
by a thin grey line running along the top of the row. If you change your mind, 
repeating the action will “de-reject” the sequence and the grey line will be 
replaced by an orange one. 
 
  
Figure 16: Manual rejection of an exported indel  
Similarly, if you export a heterozygous indel, the sequence will be 
identified by a green line along the top of the row. If you change your mind, 
the heterozygous mutation can be deleted by left-clicking the row and selecting 
; the green line will then be replaced by an 
orange one. 
 
2.3. The ‘View’ window
  
Figure 17: Flanking sequence around a possible 
mutation  
The View window displays the sequence flanking the 
mutant nucleotide (Figure 17). The image includes three lines of text; the upper 
shows the trace sequence, the middle and lower show the reference nucleotide 
sequence and its translation (where applicable). Positions where the trace and 
reference sequences differ are highlighted by pink rectangles, and the blue box 
identifies the nucleotide under scrutiny. 
 
2.4. The ‘Edit’ form
  
Figure 18: Compare different trace files and edit 
base-calling  
The Edit form can be accessed either by clicking on 
a cell in the data grid or via the Edit button at the 
bottom right of the Results tab (Figure 9). The Edit form contains four regions (Figure 18). The lower three 
display trace file data, while the upper panel shows the relevant reference file 
genomic, exonic and protein sequences. The sequence displayed in each of the 
lower panels can be changed by selecting a different file name from the 
drop-down list above each panel. The format of each trace image is the same as 
that used in the View window described above. However, 
unlike the View window, it is possible to navigate 
along the sequence, using the scroll bar beneath the upper text panel. The 
nucleotide displayed in red text in the top panel corresponds to the position 
between the two red lines in the three trace image panels. 
It is also possible to edit the base calling for each trace file using this 
window. To make a visual assessment of the validity of a possible mutation, 
display the forward and reverse sequences from an individual in the top two 
trace panels and a reference sequence in the lowest panel. If it appears that 
the mutation is either a sequencing artefact or base-calling error, the sequence 
can be edited by placing the cursor over the aberrant base, clicking the left 
mouse button, and selecting an option from the floating menu (Figure 19). 
 
  
Figure 19: Editing the trace sequence  
To change or delete a nucleotide, place the cursor over that nucleotide. To 
insert a nucleotide, click on a position in the space between two residues and 
select the  option. This will insert an N in 
the sequence, which can then be reselected and edited to the desired residue. 
After editing a sequence, the underlying data are realigned once the edit form 
is closed, whereupon the mutation data grid will be revised. 
 
2.5. The ‘Annotate’ form
  
Figure 20: The single-nucleotide mutation annotation 
form  
Once the putative mutations displayed in the data grid have been checked, 
they can be exported, either to a text file, a web page or an import file for 
the LOVD mutation database. However, before this 
can be done, the mutation must be selected for export, and (optionally) 
annotated. 
To mark a single-nucleotide mutation for export, select the  option (Figure 9a), to display 
the form shown in Figure 20. This contains an image of the selected region, in 
the same format as in the previous two forms. The form also contains text boxes 
that allow annotation of the mutation. Those boxes describing the sequence 
variant itself are automatically filled by the program, but can be user-edited. 
Once the annotation process is complete, click Export 
to mark the sequence variant for export. (If you later change your mind, click 
Ignore.) A new sequence can be selected using the 
drop-down list-box above the image panel, while the displayed nucleotide 
position can be selected using the list-box beneath this. (Only positions that 
are linked to a mutation are selectable in this drop-down list.) Alternatively, 
you can cycle through the sequence variants by pressing the Previous or Next buttons. 
To annotate and export heterozygous indels, or complex mutations that affect 
more than one nucleotide, select  >  (Figure 9a); this 
displays the Annotate heterozygous indels 
form described in Section 3. 
 
2.6. Genotyping a SNP
  
Figure 21: Saving genotype data  
To demonstrate the  function, analyse 
the data in the BRCA2 SNP folder, using the 
reference file 2f to 2r start 9785 length 250.ref 
(also in the same folder). This folder contains 190 files, and should generate 
the data grid shown in Figure 21. When the data are viewed, a number of 
polymorphic nucleotide positions are identified. To analyse the genotypes of the 
files at nt 77, left-click a cell in the corresponding column, select  from the floating menu, and then save the data 
as a web page or text file. If you select “web page”, an additional folder will 
be created to contain the images needed by the web page. 
Figure 22 shows the bottom of the resulting web page, with the data from the 
last two trace files and a summary of the allele frequencies for the set of 
sequences. 
 
  
Figure 22: The bottom of a genotype web page  
When creating a genotype webpage, GeneScreen 
counts the numbers of heterozygous and homozygous sequences in the sequence 
trace folder. Since the scoring of some variants is adversely affected by 
differences in context-dependent dideoxynucleotide incorporation, the allele 
frequencies for forward and reverse sequences are shown separately. The 
rationale for this is illustrated by Figure 22, in which the forward sequence 
(upper image) displays a reduced peak height of the G allele compared to 
the A. In some situations, this may cause the program to disregard a 
heterozygous sequence variant (if the difference in peak heights is greater than 
the heterozygous cut-off limit). 
 
2.7. Analysis of rejected sequences
Sequence files can be rejected on a number of criteria: 
  - The sequence could not be aligned to the reference sequence.
  
  - The alignment did not span the entire region of interest.
   
  - The sequence contained more than a preselected number of sequence 
  variants. 
   
  - The user rejected the sequence manually. 
  
  
Figure 23: The ‘Failed sequences’ form  
The Failed sequences form allows you to list 
the failed sequences, screen them for heterozygous indels, or move/copy them 
to a different directory. 
Pressing the Failed button at the bottom left of 
the Results grid (Figure 9a) displays the 
Failed sequences form (Figure 23). This allows 
the user to perform certain actions on the rejected files. The files are grouped 
together, depending on the reason they were rejected. To select which group(s) 
to work with, tick any or all of the three boxes in the upper panel. (Note that 
files which failed by exceeding the permitted number of sequence variants 
(criterion 3 above) are included in the middle category (The alignment did 
not span the full sequence of interest). The task you wish to perform is 
then selected from the radio buttons below, and is executed by pressing OK. If the option to move or copy the failed files to the 
Failed folder is selected, this folder will be created 
within the trace file folder (and named Failed). Note 
that if you do move the files, then the program will no longer be able to copy 
the files or check for heterozygous indels (see below). 
 
2.7.1. Rejected sequences and heterozygous indels
  
Figure 24: Apparent sequence failure due to heterozygous 
indel  
Since trace files that contain heterozygous indels can be very difficult to 
base call and align to the reference sequence, they often appear as low quality 
sequences. This means that all failed sequences should be checked for 
heterozygous indels. This can be done either by using the Failed sequences form described immediately above, or by 
left-clicking on the row linked to the failed sequence and selecting  > . This will open the form shown in Figure 25. If 
you arrive at this window via the Failed sequences form, all the sequences will already 
be available via the list-box (below the “Select the trace file that contains 
the InDel” label). It must be noted that the program re-reads the data file when 
analysing the sequence for indels, and so it will fail if you have moved the 
apparently failed source files to a different folder during earlier analysis 
(see Analysis of 
rejected sequences above). Further information on indel analysis using this 
form follows in Section 3 below. 
 
3. Analysis of heterozygous indel mutations
Due to the difficulty of automatically distinguishing sequences containing 
heterozygous indels from those that happen to have short regions of high quality 
data followed by low quality data, trace files must currently be analysed for 
heterozygous indels one at a time. This limitation will be resolved in future 
versions of GeneScreen. 
  
a.   
b. 
Figure 25: Screening a sequence for heterozygous 
indels  
To screen a file for heterozygous indels or other complex mutations, use the 
form shown in Figure 25. This can be reached via the Failed sequences 
window, left-clicking on the grid (Figure 24), or directly 
from the Single file tab. 
First, pick the trace file for analysis using the drop-down list. Next, click 
Analyse, and use the horizontal scroll bar (beneath 
the trace image panel) to navigate along the sequence. The text box below the 
scroll bar shows the sequences from the trace file and reference file; ambiguous 
positions are resolved by subtracting the reference sequence from the trace 
sequence. (If the trace sequence is the reverse complement of the reference 
file, this box has a pale yellow background.) The program assumes that one 
allele matches the original reference sequence; this allele is shown shaded 
blue, while the deduced other allele (result of subtraction) is shaded 
green. 
In Figure 25a, a heterozygous 2-nt deletion within the sequence AGAG is 
shown. The first ambiguous nucleotide after the deletion is visible as a pink 
rectangle. The resolved sequences for the two alleles are displayed in the text 
box at the bottom right of the window and the mouse pointer is indicating the 
position at which the dinucleotide is deleted on one allele. As in other 
windows, the red nucleotide in the text box corresponds to the position between 
the red lines in the trace image panel. In Figure 25b, the red highlighting 
point has been moved to the point at which the deletion occurs, and the 
nomenclature for the deletion is displayed in the bottom text box. 
If the program fails to align and resolve a suspected indel, adjusting the 
heterozygous cutoff value (next to the Analyse button) 
may resolve the problem. 
 
  
a.   
b. 
Figure 26: Editing the base calling on the Indels 
tab  
Since it can be difficult to accurately base-call a sequence that follows a 
heterozygous indel, the program allows manual editing of the sequence in the 
trace panel, in the same way as in Figure 19. Figure 26 
illustrates the difference editing can make. In Figure 26a (before editing) the 
mouse arrow shows the position of a spurious extra Y nucleotide that has 
been incorrectly inserted due to the mobility shift between the C and 
T peaks. In Figure 26b the extra nucleotide has been manually deleted, 
with the result that the alignment of the deduced allelic sequences in the lower 
text box is resolved. The program can now correctly annotate this heterozygous 
mutation and display the aligned allelic sequences beyond the deletion 
point. 
 
  
Figure 27: The Indel annotation form.  
As before, to export this variant, click Export to 
display the annotation form (Figure 27). This form is very similar to the one 
shown in Figure 20 
and works in the same way, except that only one file can be viewed and it is not 
possible to navigate along the sequence. This form can only export the 
nucleotide highlighted in red on the Indels window. 
The action performed by clicking Export depends on how 
you arrived at the Indels window:  
– 
If via the Single file tab, the mutation is exported 
according to the options selected on that tab (Figure 30).  
– 
If via the Failed 
sequences window or from left-clicking on the grid (Figure 24), the mutation 
data are saved along with all other mutations, and can later be exported using 
the Export data tab (Section 4, Figure 28). 
 
 
4. Exporting sequence variants
  
Figure 28: The Export data tab.  
This tab allows you to export the sequence variants to either text files, web 
pages or the LOVD database (Figure 28). If no mutations have been marked for 
export, all buttons will be disabled apart from the first, labelled Select. Pressing Select will 
display the annotation form (Figure 20), enabling you 
to annotate the sequence variants. For help in situations in which a mutation is 
of a complex type that the program cannot resolve, there is also a hyperlink to 
the DNA / protein mutation nomenclature web site next to this button. 
The first data export option Database creates a 
tab-delimited text file that contains the annotation information as well as the 
sequence variant. (If you select an existing file to save to, the data will be 
appended to the end of that file.) This file can be viewed and edited in a plain 
text editor or in a spreadsheet program. 
The LOVD option saves the data to a file that can be 
used as a template to create a database import file for the Leiden Open Variation Database, LOVD. Since there is 
no standard field format for this database, it is not possible to create a file 
that can be directly imported into LOVD. However the file is very similar to 
that needed by a standard installation of LOVD. 
The HTML page option creates a web page that 
contains all the variants marked for export, and orders them by gene name, 
nucleotide position in the reference file and patient name. If an existing file 
is selected, the original file is read and its data combined with the current 
data to produce a new file with all the sequence varants in the correct order. 
This allows the web page to be used as static web page and as a dynamic 
database. 
  
Figure 29: The Edit web page tab.  
A mutations web page can be edited by using the Edit web page tab. First load the desired web page 
using the Select button. The program will then read 
the web page and present the mutation data one row at a time, as shown in Figure 
29. Use the buttons labelled < or > to move through the rows. The current position of the 
mutation within the web page is indicated by the progress bar next to the < button. All the data fields can be edited, and the 
corrections saved by pressing the Update button. 
However, note that if you move to a different mutation without pressing Update, any changes will be lost. To completely remove a 
mutation from the edited webpage, press the Exclude 
button. Such mutations will then be indicated by a red Excluded label next to the Exclude button. An excluded mutation will not be deleted 
from the current data set, but will not be saved to the edited web page file. To 
reverse the exclusion of an excluded mutation, simply click the Exclude button again. Finally, to save the edited web page 
press the Save button, which will create a new *.htm 
file with “edited” appended to the original filename. (If a file with the 
resulting name already exists, it will be overwritten.) 
 
5. Analysing single files
  
Figure 30: The Single file tab  
For quick analysis of a single ABI sequence trace, without needing to work 
with the grid interface, the Single file tab may be 
useful. First select the reference file and test (.AB1) file and then press 
Analyse. This will open the Indels window which can be used to analyse the sequence as 
described in Section 
3. Since data analysis initiated via this tab is separate from that 
performed through the Sequences and Results tabs, different data may be analysed simultaneously 
through the Single file tab. However, if this is 
done, to export a mutation you must use the Export options on the Single file tab and not those on the Export data tab.  
                 |