• 0 Posts
  • 228 Comments
Joined 11 months ago
cake
Cake day: August 12th, 2023

help-circle



  • Turun@feddit.detoScience Memes@mander.xyzElsevier
    link
    fedilink
    English
    arrow-up
    2
    ·
    12 days ago

    I don’t understand the “that’s no how PDFs work” criticism.

    Removing data from the original file is the whole point of the exercise! Of course unique tokens can be hidden in plain sight in images, letter spacing, etc. If we want to make sure to remove that we need to degrade the quality of the PDF so that this information is lost in said lossy conversion.








  • Turun@feddit.detoScience Memes@mander.xyzaccents
    link
    fedilink
    English
    arrow-up
    20
    ·
    edit-2
    1 month ago

    Yes, because they communicate with natural behavior (I’m blanking on the word here. It’s behavior they were simply born with)

    Language is learned, but you can still “understand” another human if they are angry, crying, or laughing, no matter where they are from.







    1. Yes, but devil’s advocate: you also need a program to text files, needing a program to read sqlite files is not worse.

    2. I am confused by your requirements. Why do you need to store your data as json or XML? Would it suit your requirements to read in text files, convert to sqlite for processing and then save as a text file? What do you gain by being able to edit the files in a text editor, as opposed to a table editor? Do you maybe just need a config file (e.g. in toml format) and don’t actually do much data processing?