Skip to content

manojsingh101/streamsx.transcriptanalytics

Repository files navigation

streamsx.transcriptanalytics

Toolkit for real-time analytics of text2speech transcripts

  • Developing and running applications that use the TranscriptAnalytics Toolkit

Operators: DistanceModelScore - This operator uses a trained model (a set of collection words) which represents a topic or words being spoken on a channel. The incoming tuple would be scored against this model to generate a score which represents how close the incoming text is to the model. The output score range 0..1

The following code demonstrates how this would be done in the SPL program:

stream<rstring ivrcallid, rstring channel, rstring utterance, float64 score> scoredUtterances = DistanceModelScore(csvLineData){
	param
		testDataAttr : utterance;
		modelPath : "/media/sf_vmshare/dev/streamsx.transcriptanalytics/machine.10.25.new.model.csv"; 
		minSetSize : 5;
}

On initialization, the operator will load the model. Each incoming tuple will be used to generate a score using the model and the score would be passed as an attribute called ‘score’on the output schema.

Other optional parameters for operator

  1. minSetSize: a score is generated only if the text contains atleast these many words else default 0;

To learn more about Streams:

About

Toolkits containing operators to be used for text analytics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages