thickertoofan

joined 3 months ago
[–] thickertoofan@lemm.ee 0 points 3 days ago (1 children)

not to be a dick but everyone has some thing they feel every day, relating that to your job is stupid, even though i know this is a meme but people try to relate to it anyhow so im just thinking...

[–] thickertoofan@lemm.ee 7 points 1 week ago (1 children)

ah, well the good thing is, someone reached out to me from piefed and im transferring my community there.

[–] thickertoofan@lemm.ee 3 points 1 week ago (4 children)

i know right. lemmy.ml has the same UI and im considering that to be my next go-to.

[–] thickertoofan@lemm.ee 1 points 1 week ago (1 children)

how'd the migration work?

[–] thickertoofan@lemm.ee 17 points 1 week ago (6 children)

NOO! I loved this place.

[–] thickertoofan@lemm.ee 3 points 1 week ago

Good, amazing but I'm not a linux fanboy who will feel giddy for this. My friends would definitely press me over this. But yeah I'm happy

[–] thickertoofan@lemm.ee 2 points 1 week ago (1 children)

I ve heard this a lot, how are modems black boxes?

[–] thickertoofan@lemm.ee 1 points 2 weeks ago (1 children)

great point man, people downvoting you for nothing lol. are they earth worshippers.

[–] thickertoofan@lemm.ee 2 points 4 weeks ago

it is a mix, i won't play minecraft with fortnite like graphics, i love to play vampire survivors because its a dopamine bomb. it really depends on the situation. But id play the GTA games for the open world mechanics and graphics. you can't really make a great point out of this.

[–] thickertoofan@lemm.ee 1 points 2 months ago (1 children)

I've worked on this topic a lot, did it once last year and this year being the above update. Also, just pushed major update to the website for a cool thing: https://dcda-v2.vercel.app/ please check it out again! Well the thing is, I really don't have the motivation to work on this because this requires a large community effort to gather a meaningful count of data, and also from ML perspective, is it worth the effort? Like you'd have to take in the complexity of the hindi language itself, suppose i train the model to include the maatras, still would a model be able to identify two characters side by side conjoined by the line with the maatras? I mean if someone convinces me that this kind of dataset would have VERY much value in terms of contribution to digitization of the language and its ecosystem, and if it proves to be extremely useful for future researchers, then sure I'm down to work on it. And the implementation I'm thinking of is really really easy to implement, and we would not have to sit for hours writing samples on our own. We can distribute the task to the crowd but my idea of data collection would be getting people in person to write a few letters on a piece of paper and using cv to crop them out from the marked rectangles. I'm dumbing down the explanation but yeah it would require CV and markers. I can even collect data from the web app itself but not many people would chip in. I'm not exceptionally famous or have a huge following where I can get thousands of inputs in a few days/weeks/months. With the network I have, it would maybe take years to get meaningful variety of data, and im talking about the base characters without maatras.

sorry for large rant but yeah, i'm really not motivated to work on this but I do have the idea/ plan. I'd love to hand the torch to some newcomer or an enthusiast in ML to do it or someone who's more into it than me right now.

[–] thickertoofan@lemm.ee 1 points 2 months ago (3 children)

thanks a lot! I think, not only the joint letters but the diacritics is so diverse, and it is a shame that we don't have any dataset covering this language and it's diacritic combinations. Honestly the possibilities are infinite and i don't know how we can generalize a model for this. It is surely possible but i'm not as experienced in ML. I'd really like to get ideas on this. Talking about dataset, I think im gonna do something about diacritics included dataset in the future. I have plans but not the time to execute it to its fullest, and also that the response and impact is very less.

 

cross-posted from: https://lemm.ee/post/61282397

Open sourcing this project I made in just a weekend, planning to continue this in my free time, with synthetic data gen and some more modifications, anyone is welcome to chip in, I'm not an expert in ML. The inference is live here using tensorflow.js. The model is just 1.92 Megabytes!

 

Open sourcing this project I made in just a weekend, planning to continue this in my free time, with synthetic data gen and some more modifications, anyone is welcome to chip in, I'm not an expert in ML. The inference is live here using tensorflow.js. The model is just 1.92 Megabytes!

 

Some custom filter kernel to average out values from a chunk of pixels with some kind of "border aware" behaviour?

 

I see this error when I'm trying to upload an icon image for a community I've recently created:

{"data":{"error":"pictrs_response_error","message":"Your account is too new to upload images"},"state":"success"}

I suppose, if the state of upload was success, and assuming the API output is correct, that the image either got uploaded or got denied after upload.
It seems like we can do an improvement if there is a bug, that we should do perm check before image upload happens, this way, we can save bandwidth (i mean its negligible but i dont know if it happens in other places like image posts etc.).
And we can prevent useless upload/bandwidth usage (which i dont think happens in this case) and if this doesnt happen, then the API has a bug of giving a false status message? Just discussing here before raising an enhancement issue on the github repo. The bug is either of the two cases, I'm not sure.

 

Join if you want to have some geek discussions about it, or ask for help/ provide help.

!flask@lemm.ee

view more: next ›