This account is being kept for the posterity, but it won’t see further activity past February.

If you want to contact me, I’m at /u/[email protected]

  • 20 Posts
  • 368 Comments
Joined 3 years ago
cake
Cake day: April 9th, 2021

help-circle
  • The settlement is right at the border of what would be controlled by the Inca government, two millenniums later. It shows that there’s some decent access to the region from the west than you’d be led to believe, with the Andes in the way.

    As such, if they find other cities further east, I’m predicting that, culturally speaking, they’ll resemble nothing this one; even if they happen to be roughly the same size.

    People ate maize and sweet potato, and probably drank “chicha”, a type of sweet beer.

    “If you don’t have chicha, any small thing will do.” (reference to a certain song)

    Serious now. Potentially yucca too - it grows right next door, and if they got maize from North America then they likely traded for crops.











  • The source that I’ve linked mentions semantic embedding; so does further literature on the internet. However, the operations are still being performed with the vectors resulting from the tokens themselves, with said embedding playing a secondary role.

    This is evident for example through excerpts like

    The token embeddings map a token ID to a fixed-size vector with some semantic meaning of the tokens. These brings some interesting properties: similar tokens will have a similar embedding (in other words, calculating the cosine similarity between two embeddings will give us a good idea of how similar the tokens are).

    Emphasis mine. A similar conclusion (that the LLM is still handling the tokens, not their meaning) can be reached by analysing the hallucinations that your typical LLM bot outputs, and asking why that hallu is there.

    What I’m proposing is deeper than that. It’s to use the input tokens (i.e. morphemes) only to retrieve the sememes (units of meaning; further info here) that they’re conveying, then discard the tokens themselves, and perform the operations solely on the sememes. Then for the output you translate the sememes obtained by the transformer into morphemes=tokens again.

    I believe that this would have two big benefits:

    1. The amount of data necessary to “train” the LLM will decrease. Perhaps by orders of magnitude.
    2. A major type of hallucination will go away: self-contradiction (for example: states that A exists, then that A doesn’t exist).

    And it might be an additional layer, but the whole approach is considerably simpler than what’s being done currently - pretending that the tokens themselves have some intrinsic value, then playing whack-a-mole with situations where the token and the contextually assigned value (by the human using the LLM) differ.

    [This could even go deeper, handling a pragmatic layer beyond the tokens/morphemes and the units of meaning/sememes. It would be closer to what @[email protected] understood from my other comment, as it would then deal with the intent of the utterance.]


  • Not quite. I’m focusing on chatbots like Bard, ChatGPT and the likes, and their technology (LLM, or large language model).

    At the core those LLMs work like this: they pick words, split them into “tokens”, and then perform a few operations on those tokens, across multiple layers. But at the end of the day they still work with the words themselves, not with the meaning being encoded by those words.

    What I want is an LLM that assigns multiple meanings for those words, and performs the operations above on the meaning itself. In other words the LLM would actually understand you, not just chain words.


  • Complexity does not mean sophistication when it comes to AI and never has and to treat it as such is just a forceful way to make your ideas come true without putting in the real effort.

    It’s a bit off-topic, but what I really want is a language model that assigns semantic values to the tokens, and handles those values instead of directly working with the tokens themselves. That would be probably far less complex than current state-of-art LLMs, but way more sophisticated, and require far less data for “training”.






  • If you want, you could use GMail filters to delete those emails automatically. Here’s how:

    1. click the engine button (settings), then “see all settings”, then “filters and blocked addresses”.
    2. click “create a new filter”. Add “top of Google search” to the field “has the words”, leave other fields blank.
    3. click “create filter”, then check the “delete it” box, then “create filter” again.
    4. repeat steps 2-3 for other shit that SEO spam is likely to mention.

    Important: never use as a filter anything that legitimate users might reasonably say. Only things that you’re fairly certain to come from a spammer.

    EDIT: I repeated two steps without noticing it. My bad.