About matching first and last part of the string

Question

ghosh22 0 Junior Poster in Training

14 Years Ago

Hi everybody!! I think this is a simplest perl related problem..but still I need your help.
Here's my sample input file:

>blast
ATGGGCCTAC
ATCCACSTAT

Please note that the number of lines could be more than these two, but the Perl script should skip the first line which starts with '>'.
Now the Perl script should take multiple lines as a single line and check if the line starts with ATG and ends with TAT. If this condition is true, then the output should be "gene". Else "not gene".
But my perl script is not taking the whole file. It is taking one line at a time. Here's my script:

#!usr/bin/perl 

print "Print your file name with location\n";
$dnafile=<STDIN>;
chomp $dnafile;

open (DNA, $dnafile) || die "Cannot open the file : $!";

while ($dna=<DNA>)
{
	chomp ($dna);
	### Check Starting not equals to '>' letter
	if ($dna=~/^[^>]/)
	{
		@dna=split ('', $dna);
		
           
		print "$dna";
               
   
      
	}
     
  if (($dna=~/^ATG/) && ($dna=~/TAT$/)) {
      print "gene";
 }
  else {
  print "Not gene\n";
 }
       
}

Please let me know how can I improve it?
Thanks

perl

3 Contributors
8 Replies
123 Views
1 Day Discussion Span
Latest Post 14 Years Ago Latest Post by ghosh22

All 8 Replies

k_manimuthu 43 Junior Poster in Training

14 Years Ago

But my perl script is not taking the whole file.

undef $/;  # input record Separator 
open (FILEHANDLE, "$input_file") || die "Cannot Open the $input_file : $!";
my $file_content = <FILEHANDLE>;
close (FILEHANDLE);
print $file_content;

or

open (FILEHANDLE, "$input_file") || die "Cannot Open the $input_file : $!";
read FILEHANDLE, my $file_content, -s FILEHANDLE;
close (FILEHANDLE);
print $file_content;

k_manimuthu 43 Junior Poster in Training

14 Years Ago

Hi Ghosh,

Read the below links and try the updated code.

File Handling
File Contents
Regular Expression
and more

open (FIN, "$input_file") || die "Cannot Open the $input_file : $!";
read FIN, my $file, -s FIN;
close (FIN);

if ($file =~ m{
		^	 # Match Begining
		>	 # match '>' char
		[^\n]+\n  # Caputred the first line
		ATG.*TAT  # Match char 'ATG' followed any characters and 'TAT'
		$	 # Match End 
	       }xs)
{
	print "\nGene";
}
else
{
	print "\nNot Gene";
}

Edited 14 Years Ago by k_manimuthu because: update

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

ghosh22 0 Junior Poster in Training · Answer 1 · 2010-12-14T18:20:59+00:00

hii thanks..so u mean that I have to add this piece of code before the while loop?

ghosh22 0 Junior Poster in Training · Answer 2 · 2010-12-15T00:48:52+00:00

ghosh22 0 Junior Poster in Training

14 Years Ago

ok

d5e5 109 Master Poster · Answer 3 · 2010-12-15T03:07:44+00:00

k_manimuthu's answers should work fine. Here is a slightly different way to do the same thing.

#!/usr/bin/perl
use strict;
use warnings;

my $input_file = 'blast.txt';

open my $fh, '<', $input_file or die "Cannot Open the $input_file : $!";

my $sequence;
while (<$fh>){
    chomp;
    $sequence .= $_ unless m/^>/;#Skip the line that starts with >
}

print $sequence, "\n";

if ($sequence =~ /^ATG.*TAT$/){
    print "The above sequence starts with ATG and ends with TAT, so it's a gene.";
}
else{
    print "The above sequence is not a gene.";
}
close $fh;

This gives the following output:

ATGGGCCTACATCCACSTAT
The above sequence starts with ATG and ends with TAT, so it's a gene.

ghosh22 0 Junior Poster in Training · Answer 4 · 2010-12-15T12:10:25+00:00

hi thanks..would u plz let me know the meaning of .= in line 12?
thanks

k_manimuthu 43 Junior Poster in Training · Answer 5 · 2010-12-15T14:26:52+00:00

it is a concat statment. It means....

$sequence = $sequence . $_;

## Another one example
$first_name = 'Mani';
$last_name  = 'Muthu';
$full_name  = "$first_name". " " . "$last_name";
print $full_name;

ghosh22 0 Junior Poster in Training · Answer 6 · 2010-12-15T20:34:32+00:00

ghosh22 0 Junior Poster in Training

14 Years Ago

oh..gr8!! thnks..

About matching first and last part of the string

Recommended Answers Collapse Answers

All 8 Replies

Recommended Answers