changelog

Revision history of G-language System

v.1.8.x

Optimization work in progress list

v.1.9.2 in development

added .gbff extension to be interpreted as GenBank format
added protein_id shortcut in the G::IO structure
updated TogoWS URL

v.1.9.1 2016.01.04

bug fix in base_counter
fixed a serious bug in the previous revision, where the default codon table was set to be number 25 instead of 1

v.1.9.0 2014.11.14

added -amino option to to_fasta()
added -mean option to cumulative()
added -output=>"f" to nucleotide_periodicity()
added -table option for Alternative Codon Tables for translate()
added w_tai() and -tai option to G::Seq::Codon::cai()
added G::Seq::GCskew::coding_density()
added G::Seq::Codon::Dmean()
added G::Seq::Util::refseq2gbk()
added $gb->annotate(), a wrapper around Restauro-G v.1.5
added Rcmd::Summary::summary(), a smart statistical summary interface for given data or data set
added pca, hclust, som, kmeans (subset of Rcmd but exported by default through G) for statistical analysis
added G::Tools::EMBOSS::emboss(), which can run both EMBOSS and KBWS applications. seqret() is now deprecated.
added an option to insert desired number of 'N's between contigs in new G() and load() with 'multiple locus' option.
switched file conversion process from BioPerl to EMBOSS REST Service. Now the output formats supported are dependent on EMBOSS
http://emboss.sourceforge.net/docs/themes/SequenceFormats.html#out, but we now have support for GFF, GFF2, GFF3 output from $gb->output()
load() can now handle name of the species, its abbreviations, and NCBI taxonomy IDs for bacteria. For example, all of the following examples load E.coli K12 genome (NC_000913).
- $gb = load("Escherichia coli");
- $gb = load("e.coli k12");
- $gb = load("511145");
$gb->seq_info() now shows number of CDSs (when annotation is present) and number of LOCUSes (with multiple locus option)
updated G::Seq::Util::molecular_weight() to handle ssDNA, dsDNA, peptide, RNA, and chemical formula
bug fix for -tag option in w_value()
bug fix in G::Seq::PatSearch::signature_dist()
bug fix in G::Seq::Codon for REST services compatibility
bug fix for reading multi-line, multiple feature tags (ex. multiple long GO_function feature tag within a CDS)
bug fix for G::IO::Handler::_interpret_format() for embl extension.
many minor fixes for use from REST/SOAP services, like default file names
Removed G::Tools::Alignment (clustalw(), _fasta(), _blast(), _formatdb(), blastall(), blat(), which have corresponding KBWS tools)
Removed G::Seq::AminoAcid::monoisotopic() which was overlapping with residue()
Removed G::Seq::AminoAcid::peptide_mass() and it is now merged to G::Seq::Util::molecular_weight()
Removed G::Tools::DotE
Removed G::Tools::EMBOSS (merged to G::Tools::WebServices)
Removed requirements of SOAP::Lite, Bio::Perl, Graph::Layout::Aesthetic at installation
changed default behavior of G::Tools::Statistics::cor() to return Pearson linear correlation instead of R^2

v.1.8.13 2011.04.29

one2three and three2one are now exported (with amino acid sequence support)
bug fix for GenBank output of transl_except feature (lack of brackets)
bug fix of G-language Shell that did not unlink tmp files
major revision to G::Seq::GCskew::rep_ori_ter(). Now support DoriC and dif sequence databases, Oriloc (SeqinR) prediction, and -gcskew option.
minor documentation update for G::Seq::Codon::codon_compiler()
updates in signature() and signature_dist
added G::Seq::GCskew::geneskew()
added G::Seq::GCskew::lda_bias()
added G::Seq::GCskew::b1()
added G::Seq::GCskew::b2()
added G::Seq::GCskew::delta_gcskew()
added G::IO::Handler::set_generationtime()
added G::Seq::Codon::S()
added G::Seq::Codon::delta_enc()
added support for codon_start key for $gb->get_geneseq()
added support for multiple sequences to to_fasta()
added support for commands to readFile()

v.1.8.12 2010.08.27

corrected taxonomy for shortened description of bacilli, streptococcus, and synechococcus. (patch by haruo)
added KEGG/SwissProt parser to readFile();
bug fix for outputting partial gene locus info for complementary genes in GenBank format.
bug fix for set_operon() due to format changes.
MOBY::Client::Central used for ws() is now optional module.
IPC::Shareable used for Infinity job distribution is now optional.
fixes for G::Seq::Codon.
support for linear chromosomes in circular_map() and introns and exons in genome_map3()

v.1.8.11 2010.02.05

Bug fix in G::Seq::Util::longest_ORF
Bug fix in handling multiple instances of FASTA/FASTQ data for multiple locus datasets.
Bug fix in handling single letter exon.
added support for gzipped files. load() can read from gzipped database flatfiles, and readFile() can also handle gzipped files.
added G::Seq::Util::generate_oligomers()
added documentations for non-exported methods of G::Seq::AminoAcid
added documentations for G::Seq::Consensus
seqinfo() now calls amino_info() when the sequence is amino acid.
help in G-language Shell is now case insensitive, and can look for Prelude methods by their method names.
? command is introduced to G-language Shell (equivalent to "help")
changed sequence retrieval from BioPerl to togoWS (about x5~x10 faster)
opt_as_gb() now accepts filenames and accession numbers. For example, the following now works:

 gcskew('NC_000913');
 gcskew('somewhere/hoge.gbk');
 gcskew('embl:U00096');

taken out g2s from the main distribution.
updated the copyright date to 2010. Happy new year to all:)
removed G::Seq::FreeEnergy (to be implemented in KBWS), G::Seq::ImaGene (much better to simply use BioConductor in R), G::Seq::Markov (moved to G::Seq::PatSearch), G::Seq::PathwayAlignment (not used), G::Seq::COMGA (system classes removed), G::Tools::GlimmerM (not used, replacement with annotation with GFF is considered), G::Tools::KEGGAPI (keggapi() is moved to rest.g-language.org/keggapi/, and its interface is available in G::Tools::WebServices), G::Inspire (deprecated System methods), G::Tools::PEC (moved to G::IO::Handler)
removed G::Seq::Tandem::graphical_LTR_search (almost identical to seq2png)
removed deprecated "long sequence" and "without annotation" options of load()

v.1.8.10 2009.12.04

history is now persistent in G-language Shell
codon_usage now accepts tRNAscan-SE output
added support for contig: feature location in GenBank
added G::Tools::Alignment::blat (experimental)
added syntax-based file format interpretation to G::IO::Handler::_interpret_format()
added support for FastQ files (G::IO::FastQI). quality is stored in $gb->{QUAL}
added bbur (NC_001318) and plasmidF (NC_002483) to bundled genomes
added G::Seq::PathSearch::signature_dist
added G::Seq::Codon::scs
added FASTA/FASTQ support in readFile(), returning sequence as hash.
added Selenocysteine and Xaa (Any amino acid) to G::Seq::AminoAcid::one2three
added G::Seq::AminoAcid::hydropathy
added support for custom codon table to G::Seq::Primitive::translate
optimization of G::Seq::Codon to remove redundancy (also w/ G::Seq::AminoAcid)
fixed handling of circular/linear LOCUS type
fixed a bug in $gb->rRNA()
fixed a bug in loading number only feature value in GenBank.
fixed a bug in handling join feature without feature attribute immediately before the sequence
fixed a bug that adds blank line when encountering very long feature entry in G::IO::GenBankO
deprecated the following methods:

 codon_counter
 amino_counter

deleted the following methods:

 codon_amino_counter
 _codon_amino_printer
 _codon_usage_printer

v.1.8.9 2009.08.16

changed set_operon() to support DOOR database instead of ODB.
optimized G::Seq::Align
optimized G::Seq::AminoAcid
added msg_datafile()
added opt_inputType()
added URL support for readFile. Now you can do

print readFile('http://togows.dbcls.jp/entry/pubmed/12538262')

bug fix around $gb->output() for format other than GenBank.
support for taxonomy annotation for GenBank files (header) (contributed by Haruo Suzuki)
added -slide option to gcskew()
$gb->rRNA() can now specify the subunit type. (contributed by Haruo Suzuki)
updated generateGMap (new map controller, xy2latlng function, apikey option)
updates SubOpt to handle graph and data directories on a timely fashion
added G::Seq::PatSearch::kmer_table() and cgr()

v.1.8.8 2009.03.14

fixed genome_map2() and plasmid_map() SVG generation.
added readFile() and writeFile(). see help for details.
updated $gb->intergenic() to include stable RNA genes as genetic elements, and modified genomicskew() accordingly.
removed all G::System:: classes and gcf files. we are now creating new set of user interfaces, which will replace these functions.
removed all G::SystemsBiology:: classes related to E-Cell 1. moved G::SystemsBiology::PathwayAlignment to G::Seq::PathwayAlignment, G::SystemsBiology::DotE to G::Tools::DotE. G::SystemsBiology is deprecated.
G::Tools::Literature (pubmed search is available through Shell or togoWS), G::Tools::HMMER (web service wrapper is on the way) is deprecated.
CASYS-related methods (many of G::Tools::Blast, G::Tools::Mapping, G::Tools::Cap3, G::Tools::Sim4, G::Tools::Repeat) is deprecated.
G::Tools::Fasta and G::Tools::Blast is merged with G::Tools::Alignment. Some functions relevant to G::Seq::PathwayAlignment were moved accordingly.
stopped EXPORTing many methods starting with an underscore (_) that are only used within the modules.
added method_list() that returns an array of available G-language GAE functions.

v.1.8.7 2009.03.13

added -print=>1 option to $gb->find()
fixed bug of calc_pI(), peptide_mass(), nucleotide_periodicity(), palindrome(), codon_usage()
changed the deafult behavior to return fasta as a string as well as outputting to a file
updated set_operon() to match RegulonDB 6.0. added $genome->{$cds}->{operonEvidence} for E.coli. Moreover, about 200 bacterial genomes are now supported through Operon DataBase (ODB).
implemented togoWS(), interface to togoWS.

v.1.8.6 2009.03.11

fixed a bug in $gb->before_startcodon, after_startcodon, before_stopcodon, after_stopcodon, where the functions returned wrong sequence when the requested length was not available at sequence ends.
fixed a bug for handling -type option in G::Seq::PatSearch and find_dif()
added G::Tools::Statistics::cumulative()
fixed a bug in $gb->stopcodon() which was not working properly, probably since 1.8.4
modified $gb->pos2gene() and $gb->pos2feature() to accept two positions to obtain features existing within these positions.
updated gcsi() to version 2
added UNIX piping for G-language Shell. For example, now you can do the following:

 $gb->find() |grep replication |head -n 5

v.1.8.5 2008.11.01

fixed a bug of seqinfo() in handling capitalized sequence data.
fixed $gb->output() bug in handling '%'
fixed multi-fasta related bug in G::IO::FastaI, and optimized
fixed gcf file permission
fixed a bug in GenBank parser. WARNING: previous versions may not have correctly parsed "join" definitions that span for multiple lines.

v.1.8.4 2008.05.16

added "rep_origin" feature support for rep_ori_ter()
refactored $gb->gene2id(), $gb->seq_info() and G::Seq::Util::seqinfo()
refactoring of G::IO::Handler
added $gb->disable_pseudogenes
fixed a bug in new G() with "multiple locus" option, which probably was not working since 1.8.1.
minor update for generateGMap()
fixed a bug in G::DynamicLoader, which probably was not working for a long time...
MacOS X release now includes EMBOSS, blastall, blat, clustalw2 preinstalled.

v.1.8.3 2008.04.21

removed deprecated G::Seq::Usage, which was previously integrated to G::Seq::Codon
added error handling for G::Seq::GCskew::gcsi for when genome size is too small
$gb->find() is now case insensitive
bug fix for arrow direction for G::Seq::GenomeMap::circular_map()
added opt_list which shows the default options for Odyssay functions. This is in alpha state.
major update to G::Seq::Primitive::shuffleseq() to support preservation of k-mer count.
added $gb->reverse_strand(), $gb->relocate_origin()
genomicskew() now returns array of references to result arrays
added -at, -purine and -keto options for gcsi()
updated bundled genomes

v.1.8.2 2008.02.27

now requires MOBY::Client::Central
added help -w for searching web services through BioMOBY.
added G::Tools::WebServices::ws for running BioMOBY web services.
this is mainly the result ofBioHackathon 2008. Philosophy behind this implementation is documented here.

v.1.8.1 2008.02.14

mionor fix for dnawalk, genome_map3, circular_map for Genome Projector
G::Tools::GMap::generateGMap() can now handle levels greater than 6
HTML::Form now required for installation
added G::Tools::EMBOSS, with seqret() , and G::Tools::Blast::blastall() by Kazuki Oshita
added seqret.pl which is a simple script that use seqret() which functions identically as EMBOSS seqret command.
added blastall.pl which is a simple script that use blastall() which functions identically as blastall commandline tool, but with DDBJ REST web service (so not formatdb required!).
help command now stores documentation in virtual memory, and does not build everytime.
help shows documentation when there is only one possible entry
G::IO now keeps the pointer to the last instance created by new G(), which can be accessed by lastInstance G() or lastInstance G::IO(). SubOpt::opt_as_gb() now returns this lastInstance when there is no argument specified, so you can now do analysis in G-language Shell without making a variable for the G instance, as follows:

 G > load ecoli
 G > gcskew
 [ecoli graph comes up]
 G > load bsub
 G > gcskew
 [bsub graph comes up]

v.1.8.0 2008.01.09

added -e option to G command (Shell). This is experimental.
fixed loading bug (for .glang directory) in glang command
fixed a bug in db_save
This release is for KNOB 4.0 only.

v.1.7.x

v.1.7.8 2007.11.15

introduced "load()" method, which is equivalent to "new G()", so something like the following is now preferred.

                $gb = load "ecoli"

removed G::Tools::Graph::_UniUniGrapher
deprecated G::Tools::Graph::_UniMultiGrapher. Use G::Tools::Graph::grapher instead.
always make data and graph directories when opt_get() is called
added G::Seq::PatSearch::find_iteron() and find_pattern()
in Shell, $gb is preloaded with "ecoli" data if $gb is not saved in workspace.
added -length option for G::Seq::PatSearch::oligomer_counter()
added G::Seq::PatSearch::signature() and updated G::Seq::Codon for documentation and for addition of Ew, P2, codon_mva, deletion of entropy_cu, entropy_scu, codon_pca. (thanks to Haruo Suzuki of Idaho Univ.!)

v.1.7.7 2007.11.05

fixed bugs in find_dif(), find_ter(), find_dnaAbox() introduced or upgraded in 1.7.6.
removed G::Tools::KEGG_API3 (merged with G::Tools::KEGG_API)
lazy loading of external modules to achieve x2~x3 faster loading using x2-x3 less memory.
first integration of Infinity (distributed computing) modules.
code optimization. Now entire packages is less than 30,000 lines

v.1.7.6 2007.10.23

fixed a bug in 1.7.5 installer
removed G::Seq::PatSearch::find_seq()
added G::Seq::PatSearch::oligomer_search(), and updated other functions in this class to use this function. Now oligomer_counter() can search oligomers using degenerate nucleotide alphabet or regular expressions. See the help for these functions for details.
added find_ter(), upgraded find_dnaAbox(), find_dif() in G::Seq::PatSearch
added scalar context return value for leading_strand()
greatly improved the performance of G::Seq::GCskew::find_ori_ter()
fixed SVG-related bugs in genome_map2() and plasmid_map()
added circular_map(), a very good-looking SVG image generator by Nobuhiro Kido, used in GenomeProjector.
added dnawalk(), by Keita Ikegami, used in GenomeProjector.

v.1.7.5 2007.09.26

added $gb->around_startcodon() and $gb->around_stopcodon()
removed G::Seq::Eliminate::eliminate_pat(). try the following instead:

foreach $cds ($gb->cds()){
     next if ($gb->around_startcodon($cds, 50, 50) =~ /pattern/);
 }

removed G::Seq::Eliminate::valid_CDS(). try the following instead:

 foreach $cds ($gb->cds()){
      my $genelength = length($gb->get_geneseq($cds))
      next if ($genelength > 10000 || $genelength < 20);
  }

moved G::Seq::Eliminate::eliminate_atg to G::Seq::Util::filter_cds_by_atg. G::Seq::Eliminate is deprecated.
removed maskseq, pasteseq, cds_echo, print_gene_function_list, atgcon from G::Seq::Util
removed find_identical_gene, pseudo_atg from G::Seq::ORF
fixed new G("file", "longest orf annotation");
moved and rewritten longest_ORF to G::Seq::Util. G::Seq::ORF is now removed.
G::Tools::(PBS|H2v|EPCR) are now deprecated.
G::System::STeP is now deprecated.

v.1.7.4 2007.09.09

G::Prelude is renamed to G::IO::Handler
G::Skyline is renamed to G::IO
moved G::IO::Bioperl::convert to G::IO::_bp2gb. removed G::IO::Bioperl.
bug with BioPerl instances fixed.
G::DB::BDBI is renamed to G::DB::Handler
G::DB::Boranch is merged with G::DB::SDB and is removed.
minor modifications to G::DB::BDB

v.1.7.3 2007.09.08

p, say, puts are moved to G::Messenger (from G::Shell::Log) and are exported by default.
fixed line-break bug for $gb->output() (in G::IO::GenBankO) introduced by the 1.7.2 update.
made $gb->cds() depend on $gb->feature()
G::DB::BDB (codename: Bluebird and Orochi) implemented. Something like following works now:)

 $db = db_load("gene",  
                     -driver=>"mysql", 
                     -database=>"mus_musculus_core_46_36g", 
                     -host=>"ensembldb.ensembl.org", 
                     -port=>3306,  
                     -primarykey=>"gene_id" 
                     ); 
 say $db->{239967}->{status};

exported methods are:

             db_dbi
             db_exists
             db_path
             db_set_path
             db_save
             db_load

most work just like sdb in G::DB::SDB.

G::DB::GDBI, G::DB::GDBAPI (previous versions of Orochi) are removed.

v.1.7.2 2007.08.30

now requires DBD::SQLite.
fixed importing bugs in G::Seq::GCSkew.
fixed File::ShareDir related bugs in glang and g2s
fixed set_operon() to match the current RegulonDB format. (patch by Hiroyuki Nakamura. Thanx:)
fixed line-break problem for $gb->output() (in G::IO::GenBankO)
added -circular option to $gb->getseq(), $gb->get_gbkseq() (see help command for details)
$gb->get_cdsseq() correctly handles joined CDS entry which spans across the ends of circular chromosome
$gb->{CDS$n} is no longer separate: it is now an alias (reference or pointer) to corresponding $gb->{FEATURE$i}.
using the above, $gb->{locus_tag name} and $gb->{gene name} now works! For example, in E.coli, all of the following can access the same data.

 $gb->{FEATURE4}->{translation}
 $gb->{CDS2}->{translation}
 $gb->{thrA}->{translation}
 $gb->{b0002}->{translation}

fixes in G::Prelude to take advantage of the above. For example, the following now works:

 $gb->next_feature("thrA");
 $gb->get_geneseq("thrA");
 $gb->startcodon("thrA");
 $gb->before_startcodon("thrA");

added $gb->find(). This method allows to search within the genome database instance $gb. See help for details.

v.1.7.1 2007.07.20

Perl 5.8 is required now.
fixed persistant workspace bug for G-language Shell.

v.1.7.0 2007.07.10

added sample genomes
merged G-language Manager and g2s.pl to the package

v.1.6.x

v.1.6.13 2007.07.07

enabled searching for all modules with help -s|-g|-b without any keyword
G::Tools::GMap::generateGMap stopped tiling in the x-direction by default
moved view_cds from G::Seq::GCskew to G::Seq::Util
fixed the appearance of help -g
tidying up of the codes of G::Seq::Util
added -cumulative option for gcskew(), and cum_gcskew() is deprecated.
added gcsi()
added -filter option to find_ori_ter()

v.1.6.12 2007.06.23

added clear_cache command to G-language Shell
Math::FFT is now required.
removed the following deprecated modules

         G::SystemsBiology::Serizawa
         G::SystemsBiology::EcellReader
         G::SystemsBiology::Pathway
         G::Tools::RCluster (this is moved to Rcmd::Clustering

fixed line length bug in Rcmd.pm
added G::Tools::GMap for generating Google Map View
moved all methods related to GenomeMap to G::Seq::GenomeMap class
added genome_map3

v.1.6.11 2007.06.13

added documentation to G::Seq::Primitive
added shuffleseq method
made the "did you mean?" prediction of "help" smarter.
help -s now showing abstract of the searched methods updated documentations to match this.
added Ruby-style "p" for G-language Shell which prints the formatted data structures. "puts" also works in Ruby-style, and Perl6 "say" is a nice replacement of print, which prints lists separated by comma and also ending with a newline.
added Rcmd::Clutering: $rcmd->kmeans(), $rcmd->som(), $rcmd->hclust() added. see help document for details.
added Rcmd::Normality. $rcmd->normtest performs test of normality using Anderson-Darling, Kolmogorov-Smirnov Lilliefors, or Shapiro-Wilk method.
modified Rcmd to have $rcmd->set_mode() for temporary usage. Rcmd now inherits Rcmd::Clustering and Rcmd::Normality

v.1.6.10 2007.06.10

bug fixes of G-language Shell.
packaging for MacOS X 10.4 (Intel)

v.1.6.9 2007.06.07

added G::Tools::Statistics. exported methods are:

                 mean
                 sum
                 variance
                 standard_deviation
                 min
                 mindex
                 max
                 maxdex
                 median
                 least_squares_fit
                 cor
                 ttest

where cor include options for Spearman's, Pearson's, and Kendall's methods, and ttest supports both independent and paired Student's t-test.

v.1.6.8 2007.06.06

cleaned up the codes of G-language Shell
logging features are moved to G::Shell::Log logging now works quite nicely
modified the loading message
'-h' option shows help message for G command
removed "without annotation" option for new G()
removed "long sequence" option for new G()
Fixed a bug in EMBL parser (multiple-locus, join)

v.1.6.7 2007.06.05

added $gb->rRNA(), $gb->tRNA(), $gb->gene() or $gb->feature("rRNA"), $gb->feature("tRNA"), $gb->feature("gene") or $gb->feature("any feature type") which functions like $gb->cds()
added $gb->previous_feature(), $gb->next_feature(), $gb->previous_cds(), $gb->next_cds() which returns the feature ID of the next or previous features or CDS. See help for details.
added manual for 35 "prelude" methods
removed G::G.pm

v.1.6.6 2007.06.01

Added G::Shell::EUtils
This module provides pubmed and entrez commands to search through NCBI Entrez from G-language Shell use the help function of G-language Shell for details.

v.1.6.5 2007.05.31

Added help command to G-language Shell
Added G::Shell::Help to do the above purpose
help also searches for Bioperl functions:smiley:

v.1.6.4 2007.05.29

First package using Module::Install
SIGINT (Control-C) trap in Shell
added many shell commands for Shell
caching of genome data, which speeds up the system by 5-10 fold
use "no cache" option to use no caching, "force cache" option to rebuild cache in new G() when necessary.
added the following methods

 	query_arm
	dist_in_cc
	set_strand
	genes_from_ori

v.1.6.3 2007.01.12

G::Seq::, G::SystemsBiology::, G::Tools:: classes now use SelfLoader. This resulted in 20% speed-up.

v.1.6.2 2006.05.13

porting to MacOS X

v.1.6.1 2006.04.11

porting to KNOB (Knoppix for Bio)

< v.1.6

v.1.4.9 2004.11.14

porting to Windows (final version for Windows) We are no longer supporting Windows
Included G-language Shell (Interpreter)

v.1.1.0 2002.07.31

micro module core

The former 'Prelude' core has been subdivided into micro module cores each responsible for specific functions. This enables more Object Oriented style of architecture, and better flexibility in plugging. Another advantage is to use Skyline core functions without doing 'use G;'. This can be alternatively called internally as 'use G::Skyline'.

G::Skyline inherits G::IO::GenBankO, which inherits G::IO::GenBankI, which inherits G::Prelude.

The following is the explanation of [new|altered] modules:

  G.pm              Inherits G::Skyline. Now only responsible for 'new G()' options and output options
  G::Prelude        Core of core. Base class with manipulation methods only.
  G::IO::GenBankI   Inherits G::Prelude. Embedded with GenBank parsers.
  G::IO::GenBankO   Inherits G::GenBankI. Responsible for GenBank flatfile output.     
  G::IO::Bioperl    Responsible for the conversion of Bio::Seq::RichSeq to G::Skyline
  G::IO::Annotation Embedded with annotation functions
  G::Skyline        Dummy class that multiply inherit the above classes

complement() and translate() are moved to G::Seq::Primitive so that they can be 'use'-ed internally from odyssey modules.

As stated above, with stronger object oriented form, it is easier to expand the functions without risking new bugs, and G::IO:: can be plugged with new parsers, such as that for FASTA, EMBL, and so on for more speed.

enhanced bioperl porting Conversion of Bio::Seq::RichSeq object to G::Skyline object is further enhanced, now almost perfectly mirroring including the header and the 'join' field.

 G::IO::Bioperl::convert(struct Bio::Seq::RichSeq, struct G::Skyline);

easily converts the bioperl object.

SubOpt protocol is also enhanced so that now it can take bioperl object as well as G object. Therefore, all odyssey functions can be directly accessed from bioperl as follows:

         $in = new Bio::SeqIO('-file'=>"hoge.embl");
         $bioperl = $in->next_seq();
         gcskew($bioperl, -window=>1000, -output=>"show");

new interpreter for loading options

Multiple options can now be used. For example,

   $gb = new G("hoge.embl", "EMBL", "no msg", "without annotation", "longest ORF annotation", "multiple locus");

is now possible. Order can be random.

Database formats supported is also expanded, now supporting all of the following: GenBank, Fasta, SCF, PIR, EMBL, raw, GCG, ace, BSML, swiss, phd, game, qual. Moreover, file format option is now case insensitive. 'GENBANK', 'GenBank', 'genbank' all point to the same old GenBank. Furthermore, Skyline core can now automatically guess the format of the database even without the format options. Thus 'new G("hoge.gbk")' load genbank file, and 'new G("hoge.bsml")' automatically loads in BSML format. From now on, you probably do no have to care about the database formats at the input to G-language GAE.

Network retrieval is also automatically interpreted, and enhanced. Accession numbers starting with NC_ is taken from RefSeq in NCBI, and other accession numbers are taken from ordinary GenBank ftps. Again, here you do not have to enter 'net GenBank' option, although if you prefer that, you can.

BSML and LabBook Genome XML Viewer (TM)

Because BSML output is enabled, annotated G object can directly be put into the LabBook Genome XML Viewer (TM) which is a free genome viewer and editer of BSML files. This enables graphical genome view, circular plasmid view, with zooming and refering functions. Also, it is worth noting that it is the first step of G-language GAE to be able to input/output XML formats of databases. GAME XML format is also supported.

Bug fix and addition of modules

Bug in Prelude ($gb->del_key(), parsing of FT w/o value, -w switch compliancy) is fixed, and several new modules are incorporated into the Odyssey functions layer. The documentations for the new functions will be available at our website by the official release of version 2.

v.1.0.0 2001.09.11

initial relase

Related Projects

Contact

Kazuharu Arakawa, Ph.D.

G-language Project Leader Associate Professor

Institute for Advanced Biosciences Keio University

997-0017 Japan Tel/Fax: +81-235-29-0800 gaou@sfc.keio.ac.jp