RE: LeoThread 2024-12-30 17:37

You are viewing a single comment's thread:

#threadcast #CTTPodcast Ep 120 - How #AISummaries project uses AI to summarise Youtube videos using the Hive blockchain as the text storage and data feed and rewards system: https://x.com/3speaktv/status/1873857558642876703



0
0
0.000
120 comments
avatar

Thinking through this project - what if any aspect will storage of AI generated data have on the network, and when if ever would the network need additional storage with increased writes to the chain?

0
0
0.000
avatar

It's text... so not a real impact... text is very well compressed now.

0
0
0.000
avatar

The cost for a month of access to to this tool is a 5 $HBD subscription. Contact @mightpossibly for questions or more information.

0
0
0.000
avatar

Thanks for the info. That's pretty affordable.

0
0
0.000
avatar

@mightpossibly theorizing that increasing the quantity of high value on chain intrinsically increases the value of Hive.

0
0
0.000
avatar

Briefly how well does HIVE compare with other chains for storing large volumes of textual data?

0
0
0.000
avatar

I would say a lot... but I don't have any numbers... would need to have a look to show the differences.

0
0
0.000
avatar

I'm do not know how it compares to other chains, but what I can say is that HIVE is really good at it. I am not aware of any other chains that does this this well (or at all)

0
0
0.000
avatar

Quality data is a very important metric, but I would add as well, the fact that its then minable by any other tool without the need of extensive GPU processing needed to check a video that can also later on, be deleted.

But like @mightpossibly, there are many perspectives.

Either way, "text" is a very cheap resource on #hive, and if anyone wants to generate more data, they will consume more RC and that should bring more value in itself, to the chain, just alone on the using the chain.

0
0
0.000
avatar

You're right. That's a perfect system

0
0
0.000
avatar

Here is the value of the Hive blockchain:

The ability to provide up to date data....this is where social media apps excel.

Scrapping data for Ai training is one thing; another is to have the models updated outside training.

This is what vector databases tied to social media does.

0
0
0.000
avatar

Hive's major advantage is no direct transaction fees.

Most other chains have direct transaction fees that cause an issue for data storage.

0
0
0.000
avatar

hmm didn't realized that caused data storage. Why is that?

0
0
0.000
avatar

That's the best deal on the planet currently

0
0
0.000
avatar

@theycallmedan with the timely reminder that the Resource Credit system is a rational limit for writes to chain for most users.

0
0
0.000
avatar
(Edited)

My #hive node is around 507G (not counting history data that is generated from this data) of chain data, in 8 years? Nothing... any disk nowadays can store this, even NVMe's

😎

0
0
0.000
avatar

That's interesting to know! And my thoughts exactly. I was asking in hivedevs discord and they said that a full node with all historical data, 3 tb should be more than enough for the foreseeable future

0
0
0.000
avatar

Yeah, the historical data had a massive improvement with the HAF code. That's on my list for 2025. And it should fit all in a 4TB NVMe... which is crazy to think about!

0
0
0.000
avatar

Another point: each youtube url that is posted have the video viewing available on the Hive front end. Over time, this is adding an enormous amount of utility since there is content for people to look at.

AI will help by being able to build a recommend engine.

0
0
0.000
avatar

Here is how it looks:

0
0
0.000
avatar

Part 1/6:

Bitcoin Approaches $90,000 Amid Market Declines

As we approach the end of the year, Bitcoin has recently slipped closer to the $90,000 mark, showcasing a drop of about 2%, trading just shy of $92,000. Ethereum has also faced a decrease, with a loss of half a percent, sitting at approximately $3,333, while XP has plunged by 6.5% to $21. In the corporate sector, MicroStrategy's shares have fallen 6% after the company revealed another round of Bitcoin acquisitions totaling $29 million.

D. Kwon's Extradition to the U.S.

0
0
0.000
avatar

Part 2/6:

In a significant development, the former CEO of Terraform Labs, Do Kwon, is set to be extradited to the United States. The Montenegro Ministry of Justice recently approved the extradition, despite Kwon also facing charges in South Korea, where Terraform Labs was founded. He is primarily charged in relation to the collapse of the algorithmic stablecoin Terra USD, events that many attribute to triggering the 2022 crypto winter.

Interview with William Quigley on Stablecoin Growth

0
0
0.000
avatar

Part 3/6:

In today's segment of CBC's Crypto World, McKenzie Seallow interviewed William Quigley, co-founder of Wax and Tether. The discussion focused on the evolution of the stablecoin market and its potential growth in 2024 in light of new U.S. regulations expected in 2025. Quigley reflected on how stablecoins have played a vital role in crypto trading, facilitating as much as 60-70% of all cryptocurrency trading pairs. Despite their significant contribution, Quigley noted that the adoption of stablecoins for everyday consumer use—such as online shopping and peer-to-peer transactions—has been lackluster.

Institutional Use of Stablecoins

0
0
0.000
avatar

Part 4/6:

When questioned about institutional engagement with stablecoins, Quigley expressed that many non-crypto-focused companies still lack a comprehensive understanding of blockchain technology. He highlighted that while there has been an increased interest in crypto trading spurred by the approval of Bitcoin and Ethereum ETFs, there remains little exploration of stablecoins' utility beyond trading.

Quigley suggested that the potential cost savings provided by stablecoins, particularly for cross-border transactions, could fundamentally disrupt the existing global payment infrastructure. He mentioned that traditional financial institutions could be resistant to stablecoin adoption as it threatens their profit centers tied to currency conversion and transaction fees.

The Future of Stablecoins

0
0
0.000
avatar

Part 5/6:

Quigley is optimistic about the future of stablecoins, predicting that within the next decade, major global economies will adopt tokenized forms of their fiat currencies. He believes this will greatly enhance payment efficiencies, allowing for significant reductions in transaction costs. He emphasized that current regulatory discussions are crucial, and legislation around stablecoins should address the coexistence of privately issued and sovereign stablecoins.

There is a call for a clearer understanding among lawmakers regarding stablecoins' revolutionary potential in reducing friction in cross-border payments. Quigley underscored the immense annual savings in conversion costs that could be achieved if stablecoins gain broader acceptance and functionality across different industries.

0
0
0.000
avatar

Part 6/6:

Conclusion

In summary, the current landscape of the crypto market reveals volatility for major cryptocurrencies like Bitcoin and Ethereum, alongside pivotal developments such as D. Kwon's extradition. The interview with William Quigley highlights the critical role that stablecoins are playing in the current trading environment and their vast potential to reshape global payment systems in the future. With anticipated regulatory advancements on the horizon, the stage is set for a deeper integration of stablecoins into both institutional frameworks and everyday commerce.

0
0
0.000
avatar

Joel Valenzuela recently had an interview with Julian Chandra of Odysee, a project that decided to move to Arweave blockchain

The Arweave network is about Permanent information storage. like Bitcoin, but for data: A permanent and decentralized web inside an open ledger

We should perhaps look at what they are doing to see what Hive Blockchain's differentiator is.

0
0
0.000
avatar

The AISummaries are adding more text data being stored on Hive, the more resources LeoAI will draw from when it launches :)

0
0
0.000
avatar

That is true. It will be up to date based upon what was posted as opposed to being up to the last training date.

0
0
0.000
avatar

It's a good thing that you have such a feature that won't allow the same video to be summarised more than once

0
0
0.000
avatar

We were at about 600 posts (comments) in mid November. Here is where it is now.

0
0
0.000
avatar

Whoa! That's a huge growth in a little more than a month!

0
0
0.000
avatar

Doing around 5K threads per day with it.

If we get a few more people joining in, we can then increase it.

0
0
0.000
avatar

Impressive! Will try to check it out after the new year :D

0
0
0.000
avatar

And this is with just a handful of people. Imagine if we were 20, 50 or even 100 people summarizing videos

0
0
0.000
avatar

Yep, the gab seems quite yuge

0
0
0.000
avatar

Holy cow! Didn't expect to be so many. Is there a graph made by someone about who's requesting things... would be a good way to keep an eye about usage.

0
0
0.000
avatar

That problem of youtube deleting videos, could probably be solved by doing an evaluation of the video and checking for similar parts. But this would also be a challenge because people that use parts of others videos, could introduce false positives...

But a nice thing to have a look indeed. Having a tool like that for #hive would have a lot of value.

0
0
0.000
avatar

Would it be possible for AISummaries to do summaries for 3Speak videos too?

0
0
0.000
avatar

Yes eventually if he gets the time he will be able to build it.

0
0
0.000
avatar

@mightpossibly does currently see any issues with the likely data writes to the Hive Block chain when asked if the @spknetwork would conceivably be need to cover overspill during high utilization.

0
0
0.000
avatar

A good way to somehow decentivize this would be to expose the account using the service. At least reputation account would never do it...

Another way, would be, to require some sort of delegation in order to use the service. To avoid "ghost accounts" doing lots of request without any responsability.

0
0
0.000
avatar

right. Are we talking about duplicate summaries here? Both me (and the blockchain) is keeping track of all summaries, so accountablity/transparency for who summarizes what is already present. Should be even more transparent with the planned front-end and leaderboard.

I'd be interested to hear more about this delegation idea, not quite sure what you're referring to there

0
0
0.000
avatar

Should be even more transparent with the planned front-end and leaderboard.

Nice!

I'd be interested to hear more about this delegation idea, not quite sure what you're referring to there

Instead of making the service free which might attract scripting to do attacks, add a "very small" (suggestion) Hive delegation to an account that you can use for curation. This avoids decentivizes attacks from multiple accounts as they would need to power delegate hive in addition, to use the command.

Then you can even say, that if you delegate 5 HP, you can do 1 command per day, 10 HP, 2 per day, etc... (can be other ratios, just exemplifying).

Obviously this can come later as now raising awareness is probably more important, I am guessing.

0
0
0.000
avatar

Hm I don't think we discussed making it free? I feel like the subscription model is beneficial in several ways, as it both adds utility to HBD and makes it less tempting for spammers/attackers to misuse it. But I'm definitely open to it being things that I haven't thought about yet.

The case of subscribers using scripts to spam the service is somethign I have thought about, and would likely require me to implement some sort of upper limit of daily allowed requests. I'm intriguged by the idea of such a system, but I'm also a bit afraid a system like this would make it unneccessarily complex to use the service

0
0
0.000
avatar

Ups, ignore the free part. I was confusing something in my head.

But on the other hand, it would be important to think about users in general and how confortable they feel with using inleo to pay for a service. Hence why I was referring to the delegation option, which can be also in LEO...

But just to explain, the use of delegations could be an option for longer periods of time as an "accumulator" of tokens that then can be spent by the user at his own leasure.

This in my view is a more attractive way of payment because of its trustless nature. But, obviously spending HBD is more attractive as a financial model to support a service.

0
0
0.000
avatar

Ah yes! This is not a bad idea. You got my mind going. Probably not gonna do this just right now, but definitely gonna keep in in mind for the future - either for this or something else.

Regarding not using INLEO to pay for the service. How about offering an alternative solution: Simply transfer 5 HBD to @ai-summaries, no memos or anything? In addition to the existing option that is.

0
0
0.000
avatar

That could be an option too, but I would recon the value would decrease, because your account itself has "an intrinsic" value to subscribe for. So, it would have to be lower... if right now.

Otherwise in the long term when inleo is already large enough and your service too, then I would say it might not matter anymore. It all depends of what you make about the "subscription" value of a specific account.

0
0
0.000
avatar

Not sure I understand the part about decrease value and intrinsic value. Do I understand correctly that you support the idea to offer an additional way to subscribe (directly transfer funds to ai-summaries), but that 5HBD is too high price? I'm still on the fence on the best way to proceed, that's why I keep asking. Highly appreciate all your input!

0
0
0.000
avatar

What I mean with intrinsic value here, is the value you may offer/represent to other users by just things you post "privately" in your account and would require users to subscribe to you.

0
0
0.000
avatar

right. Are we talking about duplicate summaries here?

Nah, it was the fact that "idiots" might want to attack the chain by scanning for crap videos and flood the command. And how to decentivize doing that...

0
0
0.000
avatar

Right.. Yes indeed, how could we prevent that. It is possible to blacklist accounts of course, but this would quickly become difficult if the userbase increased. Do you think the current 5HBD fee is at least partially contributing to achieveing this?

0
0
0.000
avatar

5 HBD will be more than sufficient I would say. I would start lower to create some adoption and see when the "problem" starts appearing.

0
0
0.000
avatar

This will come into focus when LeoAI eventually rolls out.

The data is the first step to get it posted but it also requires utilizing it.

0
0
0.000
avatar

Ahh interesting we are getting some history of the development of @mightpossibly AI transcription system.

0
0
0.000
avatar

@mightpossibly just in case my tone or context was missed in the original question about network load, I asked because I don't a good mental model for it. My apologies if it sounded like I was implying something negative.

0
0
0.000
avatar

Its good to ask these questions =) people that might have the same worries :D

0
0
0.000
avatar

On the contrary. You're referring to this one, right? I may be missing the point of your question, but I do not see a problem with writing data to Hive. I do find it sensible to avoid summarizing the same content over and over again though. It was perhaps this you was referring to?

https://inleo.io/threads/view/alohaed/re-cttpodcast-q2lg9ham

By the way, I really liked how you kept the threadcast alive during the stream, reporting on what was being discussed as it progressed. Really useful in many ways - for those not listening, those who are half-listening, and also for those coming back reading /listening later after the stream ends

0
0
0.000
avatar

So it's already possible to do AIsummaries for 3Speak vids, it just needs some more functionality features from the 3Speak network. I suppose they will work on that soon😉

0
0
0.000
avatar

Good discussion - properly exchanging value for value - even though @mightpossibly is generating AI data - his intellectual work was needed to create and iterate this system?

@somecallmedan is this a good use-case for DHF funding?

0
0
0.000
avatar

Data storage moves to data utilizatoin.

Have to build ai agents on top to utilize said data.

0
0
0.000
avatar

That would massive! YouTube people do rely a lot on trying to use the "summaries" to create their video timelines. So this could actually create some demand if people find use for it.

0
0
0.000
avatar

Auto-summarising videos from a favorite video creator would be an amazing feature.

0
0
0.000
avatar

I agree! Initially it will probably be more like semi-auto, but yeah. Maybe a bit further down the road

0
0
0.000
avatar

think i am late for this. sounds interesting. will have to get back to it.

0
0
0.000
avatar

It might make sense to make an explicit funding post once a week , where people can can come in and upvote it. Presumably it would be supported by by curation services.

0
0
0.000
avatar

looks like it is close to the end. did you actually did a simple explanation for noobs what is going on here? :)

0
0
0.000
avatar

You listen to the initial call recorded on X...

0
0
0.000
avatar

will replay it, probably right away. was thinking is there a noob explanation i could maybe clip :D

0
0
0.000
avatar

Ai summarized timeline on a clip like this would allow you to quickly go into that part, even if you want to listen to it instead of read (like me, I prefer to listen because I can be doing something else or even driving).

Adding more usecases to the tool. FYI @mightpossibly

0
0
0.000
avatar

Would be nice to also be able to summarize more things like, this X recording!

Would already have a usecase for @bil.prag that wanted to understand what was talked about in the podcast.

And that brings podcasts more value actually.

0
0
0.000
avatar

He does summarize individual spaces.

0
0
0.000
avatar

People are creating tokens each day, they are just giving them to Elon and Zuck.

0
0
0.000
avatar

Enjoyed a lot this one... great potential on these SUPER active calls. Crazy to think about it, if it had like 1000x more people in these threads/podcasts...

It will happen!

0
0
0.000
avatar

All that information that is shared is stored somewhere, but now I wonder if there is enough space for all that.

0
0
0.000
avatar

Currently all gets recorded on the #hive blockchain as comments. But for sure something to keep an eye for. Just to avoid "useless" things.

0
0
0.000
avatar
(Edited)

Oh! I missed the show. Listening to it now. The AI summarizer is amazing, it summarizes YouTube videos easily and notifies us when we are out of subscription and also if the video was previously summarized.

I wish we could also have for 3speak videos, it will be awesome if this AI summarizer can do that as well.

0
0
0.000
avatar

@cttpodcast you need #premium to appear in the threadcast stuff... like this...

😎 - I think I have suggested this... but it might have not been me the originator! 😏

0
0
0.000