Jan Leike: We experienced a substantial team of folks read through ChatGPT prompts and responses, then say if 1 response was preferable to a different reaction. All this information then got merged into a single teaching run. Considerably of it is the same form of thing as what we did https://andresqwbls.blogrenanda.com/37742765/an-unbiased-view-of-chatgpt