Viewing a single comment thread. View all comments

EthansWay007 t1_j1w05nk wrote

I’m curious, how do they use the data of it being asking questions to improve it? Does it flag questions it couldn’t answer and then the team updates it?

1

Nextil t1_j1zqxp9 wrote

You can rate the responses up or down and provide an "ideal" response.

2

gelukuMLG t1_j23znll wrote

I think it saves the highly rated responses and feeds it into a dataset then it uses reinforcement learning by giving a positive reward to them.

1