All Packages  Class Hierarchy  This Package  Previous  Next  Index

Class iceberg.transcoder.TranscoderService

java.lang.Object
   |
   +----ninja.rmi.NinjaRemoteObject
           |
           +----ninja.ispace.iSpaceService
                   |
                   +----iceberg.transcoder.TranscoderService

public class TranscoderService
extends iSpaceService
implements TranscoderServiceIF
Class that provides various transcodings of audio and text. audio -> text summary audio -> text outline audio -> audio summary audio -> skimmed audio audio file -> key word text file audio -> text audio file -> text file text -> audio text file -> audio file


Constructor Index

 o TranscoderService()
constructor

Method Index

 o audioAnalyzer(String, String, String)
 o audioSummary(byte[], int)
make an Audio Summary from voicemail
 o destroy()
 o getNewFilename()
 o init(iSpaceServiceConfig)
Initialization: cleanup
 o main(String[])
main program that recognizes test.pcm audio file to play file: play -t .raw -r 22050 -w -s test.pcm
 o readFromStream(InputStream)
 o readFromStream(String)
Reads a file into a byte array.
 o skimmedAudio(byte[])
Method which generates skimmed audio from speech audio
 o speechRecoCommand(String)
Method which uses rule grammar (NLP) for spotting key phrases in a speech audiofile
 o speechRecognizer(byte[])
Method which transcodes speech to text
 o speechRecognizer(String)
Method which uses dictation grammar for translating speech to text
 o speechRecognizer(String, String)
Method which uses dictation grammar for translating speech to text
 o speechSynthesizer(byte[], int)
Method which synthesizes text
 o speechSynthesizer(String)
Method which synthesizes text
 o speechSynthesizer(String, String)
Method which synthesizes text
 o textOutline(Outline, byte[])
Method creates a text outline from audio.
 o textSummary(byte[])
make a Text Summary from voicemail this consists of two parts a) Recognize name and phone number b) Recognize key words using pitch detection
 o toGSM(String)
 o toRaw22(String)
 o toWave22(String)
 o writeToFile(byte[], String)
Writes a byte array to a file.

Constructors

 o TranscoderService
 public TranscoderService() throws RemoteException
constructor

Methods

 o init
 protected void init(iSpaceServiceConfig config)
Initialization: cleanup

Overrides:
init in class iSpaceService
 o destroy
 protected void destroy()
Overrides:
destroy in class iSpaceService
 o readFromStream
 protected byte[] readFromStream(String filename) throws RemoteException, IOException
Reads a file into a byte array.

Parameters:
filename - the file to be read
Returns:
byte array of file contents
 o getNewFilename
 protected String getNewFilename()
 o writeToFile
 protected void writeToFile(byte theData[],
                            String filename) throws RemoteException, IOException
Writes a byte array to a file.

Parameters:
theData - the byte array of data
filename - the output filename
 o readFromStream
 protected byte[] readFromStream(InputStream is) throws RemoteException, IOException
 o textOutline
 public Outline textOutline(Outline transObj,
                            byte theAudio[]) throws RemoteException, IOException
Method creates a text outline from audio.

Parameters:
transObj - is a special MediaManager mime type
theAudio - is the byte array of speech (16 bit 22050Hz linear .wav) if you don't have the MediaManager, just comment this out.
Returns:
a iceberg.mediamanager.Outline object
 o speechRecognizer
 public byte[] speechRecognizer(byte theAudio[]) throws RemoteException, IOException
Method which transcodes speech to text

Parameters:
theAudio - is byte array of speech audio (.wav)
Returns:
a byte array of text
 o textSummary
 public byte[] textSummary(byte theAudio[]) throws RemoteException, IOException
make a Text Summary from voicemail this consists of two parts a) Recognize name and phone number b) Recognize key words using pitch detection

Parameters:
byte - array of speech audio (16 bit 22050Hz linear .wav)
Returns:
byte array of text summary
 o audioSummary
 public byte[] audioSummary(byte theAudio[],
                            int format) throws RemoteException, IOException
make an Audio Summary from voicemail

Parameters:
byte - array of speech audio (.wav)
int - output audio format
Returns:
byte array of summarized audio
 o skimmedAudio
 public byte[] skimmedAudio(byte theAudio[]) throws RemoteException, IOException
Method which generates skimmed audio from speech audio

Parameters:
byte - array of speech audio (16 bit 22050Hz linear .wav)
Returns:
byte array of summarized audio
 o audioAnalyzer
 protected String audioAnalyzer(String inputfile,
                                String operation,
                                String outdir) throws RemoteException
 o speechRecoCommand
 public String speechRecoCommand(String inputfile) throws RemoteException
Method which uses rule grammar (NLP) for spotting key phrases in a speech audiofile

Parameters:
String - inputfile of speech audio
Returns:
String output filename of text .txt
 o speechRecognizer
 public String speechRecognizer(String inputfile) throws RemoteException
Method which uses dictation grammar for translating speech to text

Parameters:
String - inputfile of speech audio
Returns:
String output filename of text .txt
 o speechRecognizer
 public String speechRecognizer(String inputfile,
                                String outputfile) throws RemoteException
Method which uses dictation grammar for translating speech to text

Parameters:
String - inputfile of speech audio
String - outputputfile of text
Returns:
String output filename of text .txt
 o toWave22
 protected String toWave22(String infilename) throws IOException
 o toGSM
 protected String toGSM(String infilename) throws IOException
 o toRaw22
 protected String toRaw22(String infilename) throws IOException
 o speechSynthesizer
 public byte[] speechSynthesizer(byte theText[],
                                 int format) throws RemoteException, IOException
Method which synthesizes text

Parameters:
byte - array of text
int - format of audio output
Returns:
byte array of synthesized speech (16 bit 22050Hz .wav or GSM)
 o speechSynthesizer
 public String speechSynthesizer(String inputfile) throws RemoteException
Method which synthesizes text

Parameters:
String - inputfile of text
Returns:
String filename of synthesized speech
 o speechSynthesizer
 public String speechSynthesizer(String inputfile,
                                 String outputfile) throws RemoteException
Method which synthesizes text

Parameters:
String - input filename of text
String - output filename of audio
Returns:
String filename of synthesized speech
 o main
 public static void main(String argv[])
main program that recognizes test.pcm audio file to play file: play -t .raw -r 22050 -w -s test.pcm


All Packages  Class Hierarchy  This Package  Previous  Next  Index