Database dump survey

Joey
Dawnsong - Derpi Supporter
Lunar Supporter - Helped forge New Lunar Republic's freedom in the face of the Solar Empire's oppressive tyrannical regime (April Fools 2023).
Flower Trio - Helped others get their OC into the 2023 Derpibooru Collab.
Roseluck - Had their OC in the 2023 Derpibooru Collab.
A Lovely Nightmare Night - Celebrated the 12th anniversary of MLP:FIM!
Cool Crow - "Caw!" An awesome tagger
Tree of Harmony - Drew someone's OC for the 2022 Community Collab
Elements of Harmony - Had an OC in the 2022 Community Collab
Non-Fungible Trixie -
Verified Pegasus - Show us your gorgeous wings!

PM me your cute OCs
@Luna  
Upvote/Downvote counts are already available on the image metadata. I’m not really a huge fan of publishing hide counts.
 
Anonymizing them by assigning each user a unique ID isn’t good - at best someone could correlate data to create anonymous profiles on people, at worse they could use those correlations to actually start identifying people.
xbi
Fine Arts - Two hundred uploads with a score of over a hundred (Safe/Suggestive)
Da Magicks! - Merited Fine Arts badge with only their own art
The Power of Love - Given to a publicly verified artist with an image under their artist’s tag that has reached 1000 upvotes
A Really Hyper Artist - 500+ images under their artist tag
Best Artist - Providing quality, Derpibooru-exclusive artwork
A Really Classy Artist - 250+ images under their artist tag
An Artist Who Rocks - 100+ images under their artist tag
Artist -

Wow, thank you!
 
I will be happy to see the following:
 
  1. table information about every image the same as JSON api provides , without duplication of many URLS which can be derived with templates (is direct link to image possible to get without date?). Tag names are optional, they can be derived from additional table.
     
    I don’t need hashes (they are long the are won’t be compressed in the dump) and duplicated urls and tags duplicated by tag_ids
     
    1b) Also list of each image favorites, it is really useful to find correlation of interest of users. I need only shortest possible nameless IDs of favorites, not names, to keep database small and little less worry about flashing someone’s names in debug information. This is many-to-many relation, I will happy with any representation ( user2pics list, pic2users text list, or what is best for database system).
     
     
  2. small table with templates to turn image ID to URLS to thumb and direct link to image. Actialy this table can be put just in readme.txt file
     
  3. tag-id to tag-name table and basic tag information (category, count).
     
     
  4. incremental updates of the database, so if I want to update month old database I can download only daily differences. But this feature can be added much-much later.
     
  5. I think indices are not needed, could be created locally after the download. Tag-2-pics, sorting based on score and fave count index would be useful, but I don’t care if is already built or I build it locally (unless it took many hours to create).
     
    Don’t need comment text, maybe if you put comments to separate database dump this will allow to keep dump much smaller.
root64
Lunar Supporter - Helped forge New Lunar Republic's freedom in the face of the Solar Empire's oppressive tyrannical regime (April Fools 2023).
My Little Pony - 1992 Edition

Redheart is my waifu.
Sounds interesting. I’d use it just to find out things like which ponies get the most fan art.
JP
Pixel Perfection - I still call her Lightning Bolt
Silly Pony - Celebrated the 13th anniversary of MLP:FIM, and 40 years of MLP!
Shimmering Smile - Celebrated the 10th anniversary of Equestria Girls!
Solar Guardian - Refused to surrender in the face of the Lunar rebellion and showed utmost loyalty to the Solar Empire (April Fools 2023).
Roseluck - Had their OC in the 2023 Derpibooru Collab.
King Sombra - Celebrated the 10th anniversary of The Crystal Empire!
A Lovely Nightmare Night - Celebrated the 12th anniversary of MLP:FIM!
Princess of Love - Extra special version for those who participated in the Canterlot Wedding 10th anniversary event by contributing art.
Elements of Harmony - Had an OC in the 2022 Community Collab
Non-Fungible Trixie -

I miss the show so much
For the data dump itself, I guess everything that’s publicly available should be in there. But perhaps the images could be in a separate dump? They take up way more space than metadata. The primary dump could include only thumbnails for quick perusing.
 
How big would the initial dump be? Will there be deltas available, so that one does not have to download everything again every day, but just few hundred megabytes of changes (or more if the images are included)? I could use these numbers, because I need to assess how big hard drives I’m going to buy.
Background Pony #A340
@JP  
As I understood it’s only a DB dump, it won’t include images. For sharing such amount of images, something like IPFS would be a good idea.
DBot
My Little Pony - 1992 Edition
Wallet After Summer Sale -

gui.derpi.user.title
I don’t see any reasons why these dumps should be provided unless you are already facing some (probably minor) problems with site which can lead to it’s shutdown (i wish it wouldn’t shutdown).
 
The only useful thing about this dump is that other developers can create an alternative web interface for Derpibooru. And considering lack of most modern features (infinite scrolling, ajax loading, interactive javascript based interface), I’m pretty sure someone would find those dumps useful.  
Just make sure those dumps contain only data faced on your endpoint public API.
Mr Zero
Solar Supporter - Fought against the New Lunar Republic rebellion on the side of the Solar Deity (April Fools 2023).
Queen of Love - For those who participated in the Canterlot Wedding 10th anniversary event
Non-Fungible Trixie -
Verified Pegasus - Show us your gorgeous wings!
Preenhub - We all know what you were up to this evening~
Ten years of changes - Celebrated the 10th anniversary of MLP:FiM!
My Little Pony - 1992 Edition
Birthday Cake - Celebrated MLP's 7th birthday
Not a Llama - Happy April Fools Day!
The End wasn't The End - Found a new home after the great exodus of 2012

There seems to be some dictionary disagreement as to what dumping means. Do you mean copying, back-up-ing, deletion, or some other definition? Will this be for the “public-facing files” only, meaning that any hiccup leading to restoring from backups might mean the vanishing of material of a bootytastic nature?  
1- I guess that’s ok?
DBot
My Little Pony - 1992 Edition
Wallet After Summer Sale -

gui.derpi.user.title
Dump in Database Computing is a file which hold data in it’s “raw” form (it is common that even human can view it as regular text file, and this file can be recognised by wide range of versions of same RDBMS), and you can restore table/tables/entire databases from dumps by importing them
byte[]
Solar Supporter - Fought against the New Lunar Republic rebellion on the side of the Solar Deity (April Fools 2023).
Non-Fungible Trixie -
Verified Pegasus - Show us your gorgeous wings!
Preenhub - We all know what you were up to this evening~
An Artist Who Rocks - 100+ images under their artist tag
Artist -

Philomena Contributor
@JP  
With the current amount of data that have been requested, it would likely be 1.5-2 GB for each one, though it might be smaller if LZMA compressed afterwards.
 
I’m not going to generate deltas because it doesn’t make sense in the format I’m offering dumps in. We’ll be offering the full data file each time.
 
It will not include image files. Those aren’t in our database.
 
@DBot  
We’re not planning any downtime events or a site shutdown. This is (1) us fulfilling our role as an archive site and (2) providing a separate path for developers to extract data from the site without having to page through the entire site.
 
@Mr Zero  
Dumps are used in the context of the pg_dump utility. PostgreSQL considers them different from backups, as they do not reflect the internal representation of the database files, and (in general) only contain heap data and the schema statements needed to recreate the database files. In contrast, a backup would also include the on-disk structure of indices, temporary tables, and the WAL as it existed at the time.
silbasa
Pixel Perfection - Hot Pockets Spotted
Lunar Supporter - Helped forge New Lunar Republic's freedom in the face of the Solar Empire's oppressive tyrannical regime (April Fools 2023).
Non-Fungible Trixie -
Preenhub - We all know what you were up to this evening~
My Little Pony - 1992 Edition
Wallet After Summer Sale -

So its all anonymous and does not identify users?
 
What developers want these data and what do they do with them?
 
(If its for securing DB sure, otherwise its hard to understand why this is a thingie…)
Anonshy
Preenhub - We all know what you were up to this evening~
Philomena - For helping others attend the 2021 community collab
Twinkling Balloon - Took part in the 2021 community collab.
My Little Pony - 1992 Edition
Friendship, Art, and Magic (2020) - Took part in the 2020 Community Collab
Dream Come True! - Participated in the MLP 9th Anniversary Event
Toola Roola - For helping others attend the 2019 Community Collab
Wallet After Summer Sale -
Friendship, Art, and Magic (2019) - Celebrated Derpibooru's seventh year anniversary with friends.
A Tale For The Ages - Celebrated MLP's 35th Anniversary and FiM's 8th Anniversary

Busy - May not Respond
I’m not a developer, but I certainly would be interested on seeing that just out of curiosity; I might even learn something from it
silbasa
Pixel Perfection - Hot Pockets Spotted
Lunar Supporter - Helped forge New Lunar Republic's freedom in the face of the Solar Empire's oppressive tyrannical regime (April Fools 2023).
Non-Fungible Trixie -
Preenhub - We all know what you were up to this evening~
My Little Pony - 1992 Edition
Wallet After Summer Sale -

@byte[]
 
Thank you for your time. My understanding of this subject is to limited to have an opinion as I do not fully understands what it entails. But I trust you devs running the site. As long as it is safe and sane I suppose…
Interested in advertising on Derpibooru? Click here for information!
Pony Arts & Prints!

Help fund the $15 daily operational cost of Derpibooru - support us financially!

Syntax quick reference: **bold** *italic* ||hide text|| `code` __underline__ ~~strike~~ ^sup^ %sub%

Detailed syntax guide