Even without context, humans find a way to branch out a conversation sensibly. Essentially, Sensibleness and Specificity Average (SSA) is metric created by Google to measure the ability of a conversational chatbot to respond to a conversation in a sensible and specific way. More about the SSA metric is yet to be understood.

That's what being open-domain is about for a chatbot. There are many ways to evaluate a language model. Using Chomsky's method, a machine can come up with grammatically correct sentences without making much sense.

One of a new breed of open-domain chatbots designed to engage in conversations across any topic, Meena's free and natural conversational abilities are closing the gap on human performance.

Google AI's blog says, To compute SSA, we crowd-sourced free-form conversation with the chatbots being tested — Meena and other well-known open-domain chatbots, notably, Mitsuku, Cleverbot, XiaoIce, and DialoGPT. It's described as a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations.

Chills went down my spine when I watched Sundar Pichai introducing Google Duplex a couple of years ago.

Google's Meena chatbot scores low on "perplexity," which is good, meaning it has less of a hard time finding the right word.

Mined and filtered — language processing and filtering done on the data.

To evaluate Meena's performance, researchers proposed a simple human evaluation metric called Sensibleness and Specificity Average (SSA), which considers two fundamental aspects of humanlike conversation: making sense and being specific. Just like humans do.

"Current open-domain chatbots have a critical flaw — they often don't make sense. The scientists behind Meena built the chatbot to be responsive to people's messages, to stay on topic, and to behave as much like another human being as possible.

To reach those goals, Meena was built as an open-domain chatbot. That's of utmost importance.

Remember the conversation between Lisa (Google Duplex) trying to book a haircut with a real person. The umm-hmm was magic.

In a surprising finding, the researchers observed a strong correlation between SSA and perplexity — an automatic metric available to any neural seq2seq model.

This week, Google introduced Meena, a chatbot that can "chat about… anything." Meena is the latest of many efforts by large tech companies trying to solve one of the toughest challenges of artificial intelligence: language.

The idea is to make a chatbot more like humans.
