Thank you for sharing this! I am currently studying NLP.. Along the way, I've be...

axiom92 · on Dec 1, 2022

If you have the dataset, you can try to train a model like T5 [1], notebook [2].

You just need to create [(input, output)] examples in the format you want.

For example

[(a Yelp review of a restaurant, [("tacos", "good"), ("margaritas", "good"), ("salsa", "bad")]].

With enough data, the model should be able to learn to generate the output in the right format.

> Python list of tuples

Things get interesting if you want to generate actual Python code. You can use a large language model with just a few examples of the task to generate such code. For example, see https://reasonwithpal.com/.

Happy to answer more questions!

[1] https://huggingface.co/docs/transformers/model_doc/t5

[2] https://colab.research.google.com/github/huggingface/noteboo...

mothcamp · on Dec 1, 2022

You could start by looking into either multitask transformers or really general seq2seq models like T5. With T5, for example, it just learns to transform one text sequence into another. So you could fine-tune T5 to produce your target sequence, but rather than outputting an explicit Python list of tuples, it would output a string that looks like a sequence of tuples.

Or maybe skip all that and outsource it to GPT: https://imgur.com/a/BQv6C3K

brooksbp · on Dec 1, 2022

Ah, so if the model is just converting input text into output text, it can really learn how to do just about anything? But, there may be certain aspects of model design that make it better at some types of conversions ("tasks") than others? And there may be certain data sets that you want to train a base model on to get base learning of such as general language comprehension, and then build on top of that for your specific use case?

mothcamp · on Dec 1, 2022

Yeah, I can see that being the case for specialized domains. With state-of-the-art models widely available to the public, knowledge of the domain and its workflows, and fine-tuning models to suit the domain will probably be your edge.

trenchgun · on Dec 1, 2022

It is kind of like a very opaque but trainable Turing machine.

gattilorenz · on Dec 1, 2022

Yours is an example of aspect-based sentiment analysis. Typically it has been tackled in two steps: first extract the aspects, then classify them as positive/negative. GPT or T5 are possible options for doing both in one go, but splitting the task seems to be still a good option [1].

[1] http://essay.utwente.nl/91778/1/Middelraad_BA_EEMCS.pdf