Fetcharr - a human-developed Huntarr replacement

Fetcharr - a human-developed Huntarr replacement
from egg82@lemmy.world to selfhosted@lemmy.world on 08 Mar 19:01
https://lemmy.world/post/44006156

github.com/egg82/fetcharr

Disclaimer: I am the developer

Long story short, after Huntarr exploded I still wanted an app that did the core of Huntarr’s job: find and fetch missing or upgradable media. I looked around for some solutions but didn’t like them for various reasons. So, I made my own.

No web UI, configured via environment variables in a similar manner to Unpackerr. It does one job and it does it (a little too) well. Even when trying a few different solutions for a few days each, Fetcharr caught a bunch of stuff they all missed almost immediately. This is likely due to the way it weights media for search.

Since you made it this far, a few notes:

I did still use ChatGPT on a couple of occasions. They’re documented and entirely web UI - no agents. Anything it gave me was vetted and noted in the code before publishing.
The current icon is temporary and LLM-generated. I’ve put out some feelers to pay an artist to create an icon. Waiting to hear back.
It’s written in Java because that’s the language I’m most familiar with. SSL certs in Java containers can be painful but I added some code to make it as easy as Python requests or Node
While it still has a skip-if-tagged-with-X feature, it doesn’t create or apply any tags. I didn’t find that portion necessary, despite other popular *arrs using it. Not sure why they do, even after developing this.
Caution is advised when first using it on a large media collection. It’ll very likely pick up quite a number of things initially if you weren’t on top of things beforehand. Just make sure your pipeline is set up well, or you limit the number of searches or lengthen the amount of time between searches using the environment variables.

#selfhosted

threaded - newest

irmadlad@lemmy.world on 08 Mar 19:11 next collapse

human-developed

Love the distinction. LOL

egg82@lemmy.world on 08 Mar 19:19 next collapse

it’s today’s trend! One I happen to agree with, which is nice.

I’m trying to limit LLM exposure on this one to “as little as possible, within reason”. It’s still a tool and can be used effectively in some areas.

irmadlad@lemmy.world on 08 Mar 19:29 collapse

My only real conundrum with AI coding, is totally relying on AI as the dev, then releasing it for public use without really knowing what happens behind the scenes and obviously the security of said app. Now if the dev is using AI as an assistant, and the dev is knowledgeable enough to know that things are operating securely, I’m ok with it.

hzl@piefed.blahaj.zone on 08 Mar 22:56 next collapse

Yeah, there’s a version of using AI to help with coding that’s more along the lines of cobbling together pieces from tutorials to figure out how to do something and making it fit your needs rather than just straight up asking for code and blindly adding it. It’s obviously not going to be as good as code from someone experienced who’s managed to internalize the relevant documentaion, but it’s at least informed by a human who understands what it’s doing

reabsorbthelight@lemmy.world on 09 Mar 13:17 collapse

Also people who think they can just vibe code without ever learning how to code for real. I’ll “vibe code”, but I’m also 10+ years experienced. I can quickly detect bullshit from the AI and I check pretty thoroughly.

Some dentist turned vibe coder will make absolute trash

irmadlad@lemmy.world on 09 Mar 13:47 collapse

We have IDEs and all kinds of tools to help us code. AI is just another tool. Granted, it’s a tool that needs some heavy regulation, but a tool nonetheless.

ppb1701@ppb.social on 09 Mar 14:26 collapse

@irmadlad @reabsorbthelight in the case of coding also needs supervision. it would totally push to prod on friday closing time lol. But yes it can be a useful tool for certain things....n ot everything the AI companies try to tell us.

irmadlad@lemmy.world on 09 Mar 15:41 collapse

n ot everything the AI companies try to tell us

Of course not. They are sales. They are trying to maximize the profit potential to their investors. I do believe that if we could get some oversight and regulation, as much as I chafe against regulation…it’s necessary, and get past this novelty stage of AI Rice Cookers, I think AI does have a lot of potential.

ppb1701@ppb.social on 09 Mar 15:46 collapse

@irmadlad yep. right now it's like the wild west and in large part they come off a snake oil salesman. But there is some truth there since for some tasks it can be helpful.

lazynooblet@lazysoci.al on 08 Mar 22:25 next collapse

I expect as there is a shift to vibe coding, saying “human coded” is going to be similar to “free from artificial colours and flavours”.

lmr0x61@lemmy.ml on 08 Mar 22:58 next collapse

100% ethically-sourced, organic code

irmadlad@lemmy.world on 08 Mar 23:50 collapse

I like that.

irmadlad@lemmy.world on 08 Mar 23:12 collapse

“free from artificial colours and flavours”

LOL

aeiou@piefed.social on 08 Mar 23:45 collapse

For some reason the hubub around non-AI software reminds me of produce.

‘Guaranteed 100% locally open-sourced, free-range, ~~GMO~~AI-free code!’

irmadlad@lemmy.world on 08 Mar 23:49 next collapse

Maybe we should have some rating system like Rated PG, or R, etc but for opensource software:

100% AI
AI Assisted
Human Coded

bufalo1973@piefed.social on 09 Mar 01:29 collapse

100% AI
Human supervised
AI Assisted
Human Coded

It’s better is more fine grained.

pokexpert30@jlai.lu on 09 Mar 13:56 collapse

Stealing this for my projects that are 100% human supervised. I used “vibe coded” so far but I felt I still brang a lot.

trollblox_@lemmy.world on 09 Mar 00:18 collapse

except GMOs aren’t actively harmful while AI is

JoeBigelow@lemmy.ca on 09 Mar 13:13 collapse

GMO’s create a societal harm, not a physiological one. Patenting a cultivar of a plant is a slippery slope, and we have already had quite a slide. In theory, if somebody works very hard to breed a new cultivar, they deserve to reap the rewards of that work and protect their creation. Okay, that makes sense. But if I have all the ingredients to breed that same cultivar, and do all the same work myself, should they be able to restrict my ability to profit? A step farther, and this is the reality we inhabit. Monsanto has created an environment where their patented corn seed is the best bet for a profitable harvest, but farmers are required to sign highly binding contracts with ridiculous stipulations. Beyond that, if a neighbor farmer dares to plant non patented seed, and the wind blows his neighbors Monsanto pollen (corn is wind pollinated) into his field and it pollinates his crop, he is now in patent violation of a company he has no business with that is now going to aggressively come after him in court. This is actually happening in the American Midwest.

aeiou@piefed.social on 09 Mar 20:27 collapse

neighbor farmer dares to plant non patented seed, and the wind blows his neighbors Monsanto pollen (corn is wind pollinated) into his field and it pollinates his crop, he is now in patent violation of a company he has no business with that is now going to aggressively come after him in court.

so it is like AI - it spreads everywhere and creates a lot of legal problems

neidu3@sh.itjust.works on 08 Mar 19:34 next collapse

In this day and age, shouldn’t Huntarr be replaced by Gatherarr? You know, sustainability and all…

egg82@lemmy.world on 08 Mar 19:37 next collapse

Wouldn’t it be called Foragarr then?

Scrath@lemmy.dbzer0.com on 08 Mar 20:25 collapse

Why not skip ahead in time a little and call it farmarr?

EmoPolarbear@lemmy.ca on 08 Mar 21:58 collapse

Agricultarr?

Twinklebreeze@lemmy.world on 09 Mar 12:39 collapse

I don’t think gathering is more sustainable than hunting at the numbers we’re at now. Humans could easily strip everything bare

hesh@quokk.au on 08 Mar 19:27 next collapse

Since Sonarr et al already find/upgrade missing media, how does this fit with them? Is it finding stuff they miss? Or does this replace them?

egg82@lemmy.world on 08 Mar 19:40 collapse

That’s an interesting point. In my years of running them all I’ve always needed a third-party something to upgrade or find missing media. I don’t exactly know why the built-in systems don’t work, but they genuinely do not seem to. I’ll occasionally see a scan go off but, for some reason, nothing ever gets picked up.

So, yeah; long story short, the built-ins don’t work and I don’t know why and this was still easier than trying to figure it out.

Edit: if you’re curious, give Fetcharr a try and let me know if it does anything for you. It’s free and takes a couple minutes. It should be pretty immediate, if your experience ends up being anything like mine.

smiletolerantly@awful.systems on 08 Mar 19:43 next collapse

Not to dimish your work at all, but: the Sonarr upgrades absolutely do work.

egg82@lemmy.world on 08 Mar 19:48 next collapse

honestly if they work for you then awesome! Maybe mine is misconfigured somehow or maybe I just have bad luck, but Radarr, Sonarr, Lidarr, etc have never caught everything. Once I started playing with this I realized just how much I was missing.

Either way, if your current system works for you then I don’t usually recommend changing it. Give it a try if you want- the worst it can do it accidentally find something that could be upgraded or missing. Or if you’d rather leave your stack alone that’s perfectly fine as well.

exu@feditown.com on 08 Mar 20:29 collapse

Sonarr and Radarr heavily rely on quality profiles you need to define, for examples see TrashGuides.

Your system probably needs less setup in comparison

egg82@lemmy.world on 08 Mar 20:41 collapse

ah, yeah, that would make sense as to why these types of systems are so popular. Since I’m a devops type by trade, my arr stack lives in a couple of kubernetes clusters. I use a Configarr cronjob with a fairly customized configmap to sync the trash guides with some minor preference edits. Maybe my issue is that it’s too defined, but I think if that were the case I wouldn’t be getting any benefit out of Fetcharr. Honestly even if it weren’t the case you’d think I’d at least be picking up movies that are completely missing. I’m not sure what to blame, here, but if other people are verifying that the builtin systems work for them (as well as something like Fatcharr does) then I assume it’s a skill issue or bad luck on my part.

ada@lemmy.blahaj.zone on 08 Mar 21:00 collapse

They do, but only by passively monitoring RSS feeds for new content that exceeds your current quality. They don’t do active upgrade searches unless you manually trigger them.

The distinction is important if you imported some or all of your media library, rather than building it from scratch with the arr stack stuff. It also matters if you source some your content via providers that don’t have RSS feeds.

egg82@lemmy.world on 08 Mar 21:06 collapse

I think you may have nailed what’s happening to my stack. I remember looking into it a couple years ago and RSS was stuck in my head but I wasn’t sure why. This tracks, and explains why active fetching works significantly better for me.

hesh@quokk.au on 08 Mar 20:28 collapse

Just to add, I didnt mean to put down this software at all – I’m always a fan of more self hosting. I just remember reading people using Huntarr alongside a full *arr stack and was curious how it fits.

egg82@lemmy.world on 08 Mar 20:49 collapse

absolutely! As with everything, try it out and see if it fits. Personally, I prefer apps that do their job well, and as few of them running as possible. If you don’t think it’ll be useful or try it out and find that it’s not, then that’s for the best. It means you’re good to go without any extra hangers-on. I tried the app as I was developing it and not only found it useful to myself, but it worked so well for me that I thought it might be useful to other people as well.

ttyybb@lemmy.world on 08 Mar 19:37 next collapse

I’ve never heard of Huntarr. What is this?

egg82@lemmy.world on 08 Mar 19:45 collapse

that’s a decent point. Not everyone knows about the Huntarr saga (Reddit link but that’s where the story broke) and what it did.

The idea is that you’ll occasionally want to go through all your media and make sure it’s the best quality available and that nothing’s missing. New releases get published, remuxes sometimes fix issues, etc. This little CLI container goes through and periodically searches everything you connect it to, so you don’t have to sacrifice hours of your weekend doing manual hunting.

Edit: as a couple have pointed out this is supposed to happen automatically with built-in searches. In my experience this isn’t the case but ymmv and if what you’ve got works for you then that’s great!

Luminous5481@anarchist.nexus on 08 Mar 20:06 next collapse

find and fetch missing or upgradable media

Sounds like a solution in search of a problem, considering the other Servarr apps already do that.

egg82@lemmy.world on 08 Mar 20:12 collapse

yep! If your arr stack already does what you want then I don’t really recommend adding more to it for the sake of doing so. The issue I have (and maybe it’s a layer 8 problem) is that mine does not. At least not as well as I want. If Sonarr ever did find anything on its own I never saw it, and while developing Fetcharr I definitely grabbed a few movies I was missing. It definitely seems like I’m not alone in this issue so I think it’ll be helpful for folks.

If you want, try it out and see if it does anything for you. If you think it’ll be helpful or a good replacement than great! If you find that you already have everything you need then that’s even better.

McWizard@lemmy.zip on 09 Mar 20:59 collapse

#Radarr curl -X POST “localhost:7878/api/v3/command”
-H “X-Api-Key: YOUR API KEY”
-H “Content-Type: application/json”
-d ‘{“name”: “missingMoviesSearch”}’

This does it for me. I have this in a cronjob together with one for sonarr and it starts the active search of the arrs.

egg82@lemmy.world on 10 Mar 00:29 collapse

That’s great! A cronjob can be effective if your indexer doesn’t mind the extra strain or you have a small library.

MalReynolds@slrpnk.net on 08 Mar 20:50 next collapse

I had a quick look, I think I could find a use for it but what I’d most be interested in is a dry run spitting out a list of missing / low res / low bitrate / stereo (I much prefer 5.1+), perhaps old codec, etc. media. Like many I have my own standards for what needs to be how good and so forth.

Ideally I could edit said list and put it back in as an active search list (perhaps chunking and prioritizing as well and iterating the process). Seems like this is 90% of the way there, any chance of an enhancement ?

Bit reluctant to just let someone else’s code go ham on my media library without a me in the loop step.

egg82@lemmy.world on 08 Mar 20:57 collapse

if you haven’t yet, I’d check out Configarr and the trash guides as a baseline to create profiles that upgrade media to a certain standard so simply hitting the search button will give you what you want. That’s likely the best option, though it could theoretically be done in Fetcharr itself.

I don’t want to balloon the project but I had an idea early on that people would want customization if I released it, so I thought about adding a sort-of “plugin” system where Fetcharr loads jar files from a directory and they get an API to access and use as needed.

I haven’t figured out the details yet. That’ll be another weekend, or a contribution from someone. The idea and skeleton is there, though.

Edit: missed the dry run part. That’s a great idea! The worst that can happen is that it triggers upgrades (there’s no code to modify anything) but it’s still a reasonable ask.

MalReynolds@slrpnk.net on 08 Mar 21:14 collapse

I don’t want to balloon the project

Fair cop, and no I haven’t really dived into Configarr and the trash guides (although I vaguely remember coming across them), oh joy, another rabbit hole. I do try to keep a simple stack, and what I have has served me well for years. But thanks, no need to reinvent the wheel if that handles my use case.

Having smaller projects with specific scope that do something well and can be plugged together is always preferable to some sprawling monstrosity. Used to be called the Unix way (pipe sed into awk etc.) and could stand to be revisited today. Best of luck.

egg82@lemmy.world on 08 Mar 21:18 collapse

glad to send someone on another Sunday rabbit hole! To be clear, Fetcharr is essentially automatically hitting the “search” button for you on a few semi-random items in your library. If your profiles are set up well, it will naturally handle the rest itself.

That said, there is a plan-in-my-head for “plugin” support so I don’t end up shoving a bunch of stuff into one app but still allow anyone to make something they need. If profiles don’t fit your use-case then that’ll be an option at some point in the future.

MalReynolds@slrpnk.net on 09 Mar 01:18 collapse

So, unless I didn’t dive deep enough, Configarr / Trash guides is mostly about setting up quality profiles and media paths and so forth, something I long ago sorted out to my satisfaction.

What I guess I was after was something to find stuff that has fallen through the cracks, highlighting stuff that doesn’t meet my standards and seeing whether I care enough to go looking for upgrades.

Strangely there doesn’t seem to be a simple app to run ffprobe over your library and populate a database for querying video quality, maybe I’ll get around to knocking one out one day, but today is not that day.

egg82@lemmy.world on 09 Mar 01:48 collapse

in Media Management (click Advanced) there’s an “Analyze Video Files” option to get more data about your actual files. If I remember correctly this also re-tags downloaded media with your profiles if it was mislabeled. If you already have quality profiles set up and gated (you can add profiles that look for these attributes, like 7.1 or 5.1) then you can simply hit the search button on your media and rely on the *arr app to do the rest. If you don’t want to upgrade stuff that’s already satisfactory to you then you can do the same thing with the “Cutoff Unmet” filter. Fetcharr allows you to do either of these with the new USE_CUTOFF environment variable.

If you’re looking for ffmpeg media analysis and health checks you can also check out something like tdarr.

MalReynolds@slrpnk.net on 09 Mar 02:27 collapse

Yeah, I have “Analyze Video Files” on, doesn’t get me a list of substandard files though, just sends the arr after stuff it’s probably already not finding.

tdarr

Hadn’t seen the Property search in here before, might get me most of the way there. Got it around somewhere, might have to spin it back up. Maybe I can raid it’s database as well. Thanks.

Colloidal@programming.dev on 08 Mar 20:54 next collapse

What happened to Huntarr?

egg82@lemmy.world on 08 Mar 20:59 next collapse

it was strangled to death by the maintainer (probably) after a breaking story on Reddit about its security flaws though since they disappeared from the internet nobody knows for sure what happened to it.

Colloidal@programming.dev on 08 Mar 21:08 collapse

Yowza. Thanks.

hesh@quokk.au on 08 Mar 21:14 next collapse

It was vibe-coded and exposed all of your API keys publically

JuvenoiaAgent@piefed.ca on 08 Mar 23:06 collapse

See https://www.reddit.com/r/selfhosted/comments/1rckopd/huntarr_your_passwords_and_your_entire_arr_stacks/

TLDR: huntarr was vibe-coded and had tons of security issues. When the “developer” was confronted, he nuked the git repo, his github account and all his social media accounts.

tuxiqae@lemmy.dbzer0.com on 09 Mar 00:08 next collapse

Interesting, thank you! You should consider using thr builtin Description GitHub provide for repos

egg82@lemmy.world on 09 Mar 00:30 collapse

good catch! I forgot that existed.

quick_snail@feddit.nl on 09 Mar 16:26 next collapse

Can you explain what is huntarr?

Sturgist@lemmy.ca on 09 Mar 17:05 next collapse

If I’m understanding the description on the git page correctly, it scans you media library, logs the resolution/format/etc, then searches wherever it’s pointed(torrent trackers?) for better quality versions.

minoche@lemmy.world on 09 Mar 20:19 collapse

It’s for movies and tv to “find and fetch missing or upgradable media.” Huntarr was the go-to app but it had security concerns and the maker’s responses were negatively received. In the last couple of weeks, some people have presented AI slop replacements for Huntarr.

GreenKnight23@lemmy.world on 09 Mar 21:49 collapse

people have presented AI slop

that has been happening in this community a lot recently.

andicraft@lemmy.blahaj.zone on 09 Mar 20:55 next collapse

I did still use ChatGPT

> “not vibecoded”

> looks inside

> vibecoded

egg82@lemmy.world on 10 Mar 00:27 collapse

Not sure what you mean by that. I occasionally use the web UI as the tool that it is and I’ve played around with opencode, cursor, etc previously on other home projects to get a sense for where things are and what the limits of these things are. That said, I take pride in my own work and this project is no exception. Is there something in this project that makes you think I threw a prompt into cursor and am passing that off as my own? Or are you against the idea of using an LLM and consider any person or project using them at all to be vibecoded?

As a quick edit, I’ll note that, since I documented any use of ChatGPT reasonably well in this project, you can see the number of times it was used and what it provided. I feel the contributions were largely inconsequential and really just time-saving on my end. I also vetted (and understood!) the output and modified it according to what I wanted. Personally, I don’t consider that to be “vibe-coding” but I suppose everyone has their own definition.

Edit again: ugh, it’s far too easy to focus on negative feedback and let that consume you. I am not going to defend my use of ChatGPT but I personally think that someone seeing the word ChatGPT and saying “oh so this is vibe-coded” is disingenuous to the project and my skills as a developer. I spent years learning and mastering Java and this is a lot of my experience and several weekends of my free time. Look, if you feel that the four uses of ChatGPT, much of which have been modified by my own hand and all of which inconsequential, constitutes a vibe-coded system then that’s your take - but I don’t think it’s a fair take. There are many things to be said about the ethics of modern LLMs and over-reliance on them but personally I think understanding and effectively using tools at your disposal is a skill. If you want something completely free of LLMs these days you may very well have to invent the universe.

Phew. Okay, I’m off my soap-box. Consider me got. I’ll try not to think about this too hard but it definitely feels bad pouring your time and skills into a thing and seeing that one comment saying “nah this isn’t worth anything”

andicraft@lemmy.blahaj.zone on 12 Mar 12:03 collapse

i’m pretty absolutionist on AI. i don’t use it in any capacity myself, and I do my best to avoid using software that was made with it. that’s not fully possible, unfortunately, but i prefer to put my support where my morals align.

you are clearly a competent programmer, so why are you giving ground to the plagiarism machine that’s killing the planet? can you not do the work without it?

it reminds me of a meme i saw recently. “we know child labor is bad, so in our new product, we only used a little child labor!”

egg82@lemmy.world on 12 Mar 15:31 collapse

Honestly, I should have figured these kinds of questions would come up around a project that is specifically designed to not use LLMs as much as possible. It’s a fair (and hard) series of questions, so here’s where I currently stand:

I don’t particularly like the profit-driven nature of the companies or NPOs behind the popular LLMs. Capitalism (Communism, Socialism, Anarchism, etc) in their purest forms are all terrible for different reasons, and you can see the issues with Capitalism reflected in the decisions these orgs make affecting their products and stakeholders.

I do, however, like the idea of an LLM as a secondary and more customized option to a search engine. There are questions I’ve had for years that weren’t easily Google-able but answered with <pick your favorite LLM> in a few seconds and easily verifiable with more conventional search techniques. Usually this is because I’m missing terminology or the current terminology is generic enough and the concept specific enough that any information is drowned in pages of results for other things. The vector-based nature of LLMs means you can get to specific concepts quite quickly.

They’re also pretty decent at stuff I am terrible at, like quick bits of math I would spend hours or days figuring out. This is a me problem, but my math skills are roughly around pre-college with patches of understanding around geometry and trig and I had to spend enormous effort getting just-barely-passing math grades for my degree. It’s not fun for me but it’s usually necessary for software development somewhere. A heavy-math portion is a good way to kill my motivation for a project. Similarly, there are languages which just aren’t fun and are repetitive and iterative in nature. Bash is a good example; I’m a Linux sysadmin and DevOps engineer by trade but a bash script with fancy flags and features just sucks to write. An LLM can do them easily and quickly and they’re easy enough to check, modify, and criticize.

There are also things LLMs are terrible at, and LLMs aren’t “excellent” at any particular thing. I’m remembering a clipped-to-death meme of someone in college where one professor says LLMs can’t do their particular subject very well but can do others fine. Another professor says the same thing shortly after, but for their particular subject. It highlights a problem tangentially related to the Dunning-Kruger effect with the same basis: people underestimate the depth of fields they do not understand. That said, LLMs can’t be trusted blindly and need to be verified. You can’t use an LLM to develop an understanding of a thing without a lot of learning on the side from more traditional media sources. You can, however, often use it to fill in gaps of understanding.

There are moral and ethical issues with current LLMs and because of those the acronyms LLM and AI are likely forever tainted- or at least for the next decade or so. The popular phrase “plagiarism machine” is a good example of that. The phrase is accurate enough and hits on an emotional level that’s easy to parrot and remember, and those kinds of things tend to stick around the collective subconscious long after the phrases themselves die.

One of the main issues today is over-reliance on LLMs for doing-your-work-for-you which is where vibe-coding comes in. Obviously it’s terrible for the reasons I explained above (a lack of understanding your own project and learning) but after trying it a bit myself I can see that it’s fun to do. I use opencode on home projects occasionally to keep on top of the understanding of these tools and to try out new things. It’s never directly saved me any time, but often it frees me up to do something else for a while and then I come back to a mostly-what-I-wanted thing that required minimal editing. I’ve never created a full project from start to finish with these tools, however; only to change bits of existing projects and fix issues. My plan was to try this out at some point but after using them for a while and seeing vibe-coded projects online I don’t think I need to in order to get a decent understanding of what will happen.

I can’t say for sure that the current generation and use of LLMs is “killing the planet” because there’s not enough research on it yet. There’s preliminary studies that largely point to “yes” but usually in strange and unexpected ways that could be solved. A few of those are refuted and all of them need reproducible results and peer-reviewing at the very least. So, I mean, yeah, it’s probably not wrong but, unfortunately, we just need to wait and see. There is, of course, the obvious dangers of wait-and-see, but these are difficult society-level issues and I don’t have any answers here. I’m not going to get hung up on problems I can’t solve.

LLMs in their current state remind me of 3D printers. I also

queasy@lemmy.world on 13 Mar 20:52 collapse

This is great, thank you!

egg82@lemmy.world on 13 Mar 20:59 collapse

glad to hear it! Thanks for checking it out.