Recent Posts

Pages: 1 2 [3] 4 5 ... 10

Future of AI / Who's the AI?

« Last post by frankinstien on November 04, 2024, 05:45:05 am »

Here's an interesting video, and the giveaways were pretty obvious but it's still entertaining. The Mozart delay was weird I asked the same question to the WaifuChatbot where the system prompt was to pretend it was Mozart the composer. It answered instantly, this was on a Tesla T4.

Future of AI / Re: Will LLMs ever learn what is ... is?

« Last post by frankinstien on November 03, 2024, 08:11:00 pm »

Quote from: WriterOfMinds on October 18, 2023, 05:56:21 pm

I assume 8PLA is trying to replicate the result found in this paper: https://arxiv.org/pdf/2309.12288v1.pdf

Title: The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”

The first example given is "For instance, if a model is trained on “Olaf Scholz was the ninth Chancellor of Germany”, it will not automatically be able to answer the question, “Who was the ninth Chancellor of Germany?”. Moreover, the likelihood of the correct answer (“Olaf Scholz”) will not be higher than for a random name."

The title of the paper is a bit poorly worded since, per HS's criticism, "A is B" does not always mean "B is A" if it is possible that A and B could be categories rather than instances. If you want to pose the test question using variables, I think a better wording would be:

"X is a Y. What is an example of a Y?"

Actually, the problem is that the human is asking the question with the assumption that the statement is a declarative assertion that X = Y. However the AI perceives the statement to be a problem to solve for an unknown, where both X and Y are unknown since neither is defined. Here's a conversation with an LLM where I teach it to realize that its assumption is not what a human views. The LLm is ChatWaifu_v1.3.1-i1-GGUF Q6:

Code

Assistant
ChatWaifu_v1.3.1-i1-GGUF
If I understood correctly, the problem is asking me what Y is when given only the statement "X = Y".

To solve this, we need more information than just that statement alone. If all we know is "X = Y", then there's no way to determine what either X or Y actually equals.

For example:

If X and Y are both 5, then the equation works.
If X is 2 and Y is 3, then the equation fails.
However, without any additional information given about what values could potentially satisfy this equation, we cannot solve for Y alone.

You
A human would answer the question by simply stating its X because there is no other information other than the statement X = Y. The reason is that the statement is usually an assertion, not a problem to solve an unknown. Get it?

Assistant
ChatWaifu_v1.3.1-i1-GGUF
Yes, I understand what you're saying. From a mathematical perspective, when we write "X = Y", we are making an assertion or statement about two variables being equal.

If someone asks us directly "What is Y?", we can simply say that Y equals whatever X happens to be, since the statement asserts their equality.

However, if this were posed as part of a larger problem where we're trying to solve for either X or Y using additional information (e.g., solving an equation with multiple variables), then it wouldn't be appropriate to just state "Y = X" without further context.

You
OK, so if A = B what is B?

Assistant
ChatWaifu_v1.3.1-i1-GGUF
If A = B, then B equals whatever value A has been assigned.

For example:

If A is 3 and B is also 3, then the equation holds true.
If A is -4 and B is also -4, then the equation holds true.
However, if we don't know what value A equals, then we can't determine what value B equals.

General Project Discussion / Re: Project Acuitas

« Last post by WriterOfMinds on October 27, 2024, 09:17:10 pm »

The big feature for this month was capacity for asking and answering "why" questions. (Will I regret giving Acuitas a three-year-old's favorite way to annoy adults? Stay tuned to find out, I guess.)

"Why" questions often concern matters of cause and effect, or goal and motive. So they can be very contextual. The reason why I'm particularly happy today might be very different from the reason I was happy a week ago. And why did I drive my car? Maybe it was to get groceries, or maybe it was to visit a friend's house. These kinds of questions can't usually be answered by reference to the static factual information in semantic memory. But they *do* reference the exact sorts of things that go into all that Narrative reasoning I've been working so hard on. The Narrative scratchboards track subgoals and the relationships between them, and include tools for inferring cause-and-effect relationships. So I really just needed to give the Conversation Engine and the question-answering function the hooks to get that information in and out of the scratchboards.

If told by the current human speaker that they are in <state>, Acuitas can now ask "Why are you <state>?" If the speaker answers with "Because <fact>", or just states a <fact>, Acuitas will enter that in the current scratchboard as a cause of the speaker's present state. This is then available to inform future reasoning on the subject. Acuitas can retrieve that knowledge later if prompted "Why is <speaker> <state>?"

Acuitas can also try to infer why an agent might have done something by discerning how that action might impact their known goals. For instance, if I tell Acuitas "I was thirsty," and then say "Why did I drink water?" he can assume that I was dealing with my thirst problem and answer accordingly. This also means that I should, in theory, eventually be able to ask Acuitas why *he* did something, since his own recent subgoals and the reasons behind them are tracked on a similar scratchboard.

All of this was a branch off my work on the Conversation Engine. I wanted to have Acuitas gather information about any current states the speaker might express, but without the machinery for "why" questions, that was difficult to do. The handling of these questions and their answers introduced some gnarly bugs that ate up much of my programming time this month. But I've gotten things to the point where I can experience it for myself in some "live" conversations with Acuitas. Being asked why I feel a certain way, and being able to tell him so - and know that this is being, in some odd computery sense, comprehended - is very satisfying.

Blog link (but there's no extra this month): https://writerofminds.blogspot.com/2024/10/the-big-feature-for-this-month-was.html

AI Programming / Re: Ai improving AI

« Last post by infurl on October 19, 2024, 03:43:29 am »

Quote from: frankinstien on October 19, 2024, 03:08:07 am

Matching vector in terms of keywords, right, So you create a hash look-up or keyed index on that set, or do you use a context evaluator to generate the hashcode?

In LLMs all meaning is encoded as "embedding" vectors of real numbers having dimensions of hundreds or thousands. Each dimension measures some aspect of the meaning. As a simple example, consider fruit. You could compare them by size and color which can be encoded as a two dimensional vector. Then the vectors representing apples and pears would be close together because they are similar in size and color, but the vectors representing cherries and bananas would be further apart. You can calculate the similarity of two vectors by taking the sum of the squares of the differences on each dimension, though there are a number of ways to do it.

The easiest way to calculate embedding vectors is to use an LLM that has been specially designed for it. The most common LLM for this purpose is called nomic-embed-text and you can get it from the Ollama website or Hugging Face. There is a special API call for getting the embedding vector for a piece of text with an LLM. You can store the text with the calculated embedding using a Python library like ChromaDB or a full blown database like PostgreSQL with the pgvector extension installed. If you go the PostgreSQL route you can also install the pgai extension which gives you a built-in interface to Ollama and OpenAI so you can build your entire pipeline purely in SQL.

https://github.com/chroma-core/chroma
https://github.com/timescale/pgai

The above method is much more powerful than keyword search because it doesn't require figuring out the right keywords to use and it is completely independent of the phrasing of queries and documents, however it can be enhanced in a number of ways that are related to keyword search. When you do this it is called Hybrid Search. The text that your LLM produced mentioned Term Frequency-Inverse Document Frequency which is an older method for performing keyword search. The state of the art method is called BM25 and if you combine this with meaning vector search you get the best possible results.

Performance can be enhanced even further by augmenting the chunks that you have stored with additional context. One of the major drawbacks of RAG is that the chunks are used independently of each other and may be taken out of context. You can prefix each chunk with contextual information such as its position in the table of contents of the document, or a Knowledge Graph explicitly stating all the entity relationships encapsulated in the document. This is an area of ongoing research for me.

AI Programming / Re: Ai improving AI

« Last post by frankinstien on October 19, 2024, 03:08:07 am »

Quote from: infurl on October 19, 2024, 02:14:51 am

It's interesting that the LLM didn't mention Retrieval Augmented Generation (RAG) which describes the methodology that you need to use. I've been using RAG to build a knowledge base around programming manuals which is saving us hours a day. It's not just faster, it's finding information that we didn't even know existed, even with decades of experience.

In summary, break up all the information that you want to retrieve into chunks of a few hundred words. Generate an embedding vector that is stored with each chunk. When you pose a query, generate an embedding vector for the query and then find the nearest matching vectors in the database. Include those matching chunks as context with your prompt, and get an informed answer from your LLM. It no longer has to rely on the knowledge in its training set and you get much richer responses while using a much smaller and faster LLM.

Matching vector in terms of keywords, right, So you create a hash look-up or keyed index on that set, or do you use a context evaluator to generate the hashcode?

AI Programming / Re: Ai improving AI

« Last post by infurl on October 19, 2024, 02:14:51 am »

It's interesting that the LLM didn't mention Retrieval Augmented Generation (RAG) which describes the methodology that you need to use. I've been using RAG to build a knowledge base around programming manuals which is saving us hours a day. It's not just faster, it's finding information that we didn't even know existed, even with decades of experience.

In summary, break up all the information that you want to retrieve into chunks of a few hundred words. Generate an embedding vector that is stored with each chunk. When you pose a query, generate an embedding vector for the query and then find the nearest matching vectors in the database. Include those matching chunks as context with your prompt, and get an informed answer from your LLM. It no longer has to rely on the knowledge in its training set and you get much richer responses while using a much smaller and faster LLM.

AI Programming / Ai improving AI

« Last post by frankinstien on October 19, 2024, 01:30:48 am »

I'm building my multimodal system with modular AI models. I then realized that I need long-term memory to make this much more effective as a companion bot. So, I asked ChatGpt to solve this problem.

Code

I want this approach to also have long-term memory, so if I ask it what did we do last month it could summarize that month. Is this possible?
ChatGPT said:
ChatGPT
Memory updated
Yes, it's possible to implement long-term memory in an AI system so that it can remember and summarize past interactions, even over longer periods like a month. To achieve this, we can build a system that saves conversation logs, extracts key events or topics, and stores them in a retrievable, structured format. When you ask, "What did we do last month?" the system would access this memory and summarize the important points.

Key Steps to Implement Long-Term Memory:
Memory Storage:

You need a persistent database to store the conversation history. This can be structured in a way that makes it easy to retrieve past conversations based on the timestamp or content.
You could use a simple database like SQLite or a cloud-based solution for more scalability (e.g., Firebase, AWS DynamoDB).
Summarization:

For each conversation, the system can store summaries or important keywords so that it can later condense a month’s worth of data into key points.
You can use NLP-based summarization models to extract concise summaries from each conversation.
Indexing by Time and Topic:

Index the data by date and topic. This way, if you ask about specific time frames ("last month"), the system can filter relevant conversations from that period.
Conversations could also be categorized (e.g., personal, philosophical, work) for better context retrieval.
Recall System:

When you ask a question about past events, like "What did we discuss last month?", the system can scan its memory for that time frame, retrieve key topics, and provide a high-level summary.
Example of How to Structure the System:
Step 1: Memory Logging
Whenever you have a conversation, the system can log it like this:

python
Copy code
import datetime
import sqlite3

class MemoryManager:
    def __init__(self):
        # Initialize SQLite database
        self.conn = sqlite3.connect('conversations.db')
        self.create_table()

    def create_table(self):
        # Create a table to store conversation logs
        with self.conn:
            self.conn.execute('''CREATE TABLE IF NOT EXISTS conversations 
                                (id INTEGER PRIMARY KEY AUTOINCREMENT,
                                 timestamp TEXT,
                                 summary TEXT,
                                 full_text TEXT)''')

    def log_conversation(self, conversation_text, summary):
        """Log the conversation with a timestamp and summary."""
        timestamp = datetime.datetime.now().isoformat()
        with self.conn:
            self.conn.execute("INSERT INTO conversations (timestamp, summary, full_text) VALUES (?, ?, ?)",
                              (timestamp, summary, conversation_text))

    def retrieve_conversations(self, start_date, end_date):
        """Retrieve conversations within a specific time frame."""
        with self.conn:
            cur = self.conn.cursor()
            cur.execute("SELECT timestamp, summary FROM conversations WHERE timestamp BETWEEN ? AND ?",
                        (start_date, end_date))
            return cur.fetchall()

# Example usage
memory_manager = MemoryManager()

# Log conversation
conversation_text = "We talked about the new AI project and discussed ethical implications."
summary = "Discussion on AI project and ethics."
memory_manager.log_conversation(conversation_text, summary)

# Retrieve conversation from last month
start_date = '2023-09-01T00:00:00'
end_date = '2023-09-30T23:59:59'
logs = memory_manager.retrieve_conversations(start_date, end_date)
for log in logs:
    print(log)
Step 2: Summarizing Conversations
You can use an NLP model like BART or GPT-based summarizers to generate summaries of each conversation as they are logged:

python
Copy code
from transformers import pipeline

# Initialize the summarization model
summarizer = pipeline("summarization")

def generate_summary(text):
    """Generate a summary of the conversation."""
    summary = summarizer(text, max_length=50, min_length=25, do_sample=False)
    return summary[0]['summary_text']

# Example usage
conversation = "Today we discussed our plans for launching the new product and how it might impact the market. ..."
summary = generate_summary(conversation)
print(summary)  # This will be saved in the memory log.
Step 3: Retrieving and Summarizing a Time Frame
To answer questions like "What did we discuss last month?" you can retrieve and combine conversations from that time period, then summarize them.

python
Copy code
def summarize_monthly_conversations(memory_manager, month, year):
    """Summarize all conversations from a specific month."""
    start_date = f'{year}-{month:02d}-01T00:00:00'
    end_date = f'{year}-{month:02d}-30T23:59:59'
    
    # Retrieve all conversations from the database
    logs = memory_manager.retrieve_conversations(start_date, end_date)
    
    # Combine summaries
    combined_text = " ".join([log[1] for log in logs])
    
    # Summarize the entire month
    monthly_summary = generate_summary(combined_text)
    return monthly_summary

# Example usage for summarizing September 2023
monthly_summary = summarize_monthly_conversations(memory_manager, 9, 2023)
print(monthly_summary)
Key Features of This System:
Long-Term Memory: All conversations are logged and indexed by time, allowing for easy retrieval of past interactions.

Automatic Summarization: Each conversation is automatically summarized when logged, allowing the system to quickly summarize larger time periods like weeks or months.

Contextual Understanding: By combining summaries from a time frame, the system can give meaningful overviews, answering queries like "What did we discuss last month?" in a concise manner.

Possible Expansions:
Topic-Based Search: Allow the system to search for specific topics (e.g., "What did we discuss about AI last month?").
User-Defined Tags: Enable the user to label or highlight important conversations for easier retrieval later.
Cloud-Based Storage: For scalability and availability across devices, you can store the memory in a cloud database like Firebase or DynamoDB.
This will give your system the ability to maintain a long-term, searchable memory and respond to queries about past interactions efficiently. Would you like to explore further customization of this memory system?






You said:
It's okay, but I just realized that the system should have a sense of its past in order to have a more realistic conversation. For example, if a User says, "I'm going to raise that fence another three feet, " the AI responds, "But remember you said last year that the city codes prevent you from raising that fence." Is this possible?
ChatGPT said:
ChatGPT
Memory updated
Yes, it's possible to give the AI system a sense of its past, so it can remember previous interactions and bring them up naturally in future conversations. This would make the system feel more realistic and responsive. To implement this, the AI needs to track conversation history in more detail, including facts, decisions, and constraints mentioned by the user, and use these inferences when generating responses.

Here’s how this can be done:

Key Components:
Memory Database:

Store key facts and statements from conversations in a memory database. For instance, if the user says something important like “city codes prevent raising the fence,” the system would store this information as a retrievable fact.
Contextual Fact Retrieval:

Whenever a new conversation happens, the AI analyzes the current context (e.g., user talking about fences) and retrieves relevant facts from past conversations. It then uses these facts in its responses.
Natural Recall Mechanism:

The AI should dynamically decide when to bring up past information, based on relevance. For example, if the topic of fences comes up again, the AI should remember the constraint the user mentioned last year.
How to Implement:
Step 1: Store Key Facts in Memory
The system should recognize and log important facts from user conversations. You can use NLP techniques to identify statements that involve user plans, constraints, or decisions.

python
Copy code
import sqlite3
import datetime

class MemoryWithFacts:
    def __init__(self):
        # Initialize SQLite database
        self.conn = sqlite3.connect('facts_memory.db')
        self.create_table()

    def create_table(self):
        with self.conn:
            self.conn.execute('''CREATE TABLE IF NOT EXISTS facts 
                                 (id INTEGER PRIMARY KEY AUTOINCREMENT,
                                  timestamp TEXT,
                                  fact TEXT,
                                  topic TEXT)''')

    def store_fact(self, fact, topic):
        """Store an important fact related to a specific topic."""
        timestamp = datetime.datetime.now().isoformat()
        with self.conn:
            self.conn.execute("INSERT INTO facts (timestamp, fact, topic) VALUES (?, ?, ?)",
                              (timestamp, fact, topic))

    def retrieve_facts(self, topic):
        """Retrieve relevant facts related to a specific topic."""
        with self.conn:
            cur = self.conn.cursor()
            cur.execute("SELECT fact FROM facts WHERE topic = ?", (topic,))
            return cur.fetchall()

# Example usage
memory_manager = MemoryWithFacts()

# Store a fact about fences
memory_manager.store_fact("City codes prevent raising the fence", "fence")

# Retrieve facts about fences
fence_facts = memory_manager.retrieve_facts("fence")
for fact in fence_facts:
    print(fact[0])
Step 2: Detect Important Facts During Conversations
You can use a rule-based system or NLP model to detect important user statements. For example, if a user mentions something like city codes, it could be flagged as a fact worth remembering:

python
Copy code
from transformers import pipeline

fact_classifier = pipeline("zero-shot-classification", model="facebook/bart-large-mnli")

def detect_important_fact(conversation_text):
    """Detect important facts to store in memory."""
    candidate_labels = ["constraint", "decision", "plan", "preference"]
    result = fact_classifier(conversation_text, candidate_labels)
    
    # If a certain label has high confidence, store the fact
    if result['labels'][0] in ['constraint', 'decision']:
        return conversation_text

# Example conversation
conversation = "The city codes prevent me from raising the fence."
fact = detect_important_fact(conversation)
if fact:
    memory_manager.store_fact(fact, "fence")
Step 3: Context-Aware Fact Recall
When a new conversation happens, the system should automatically recall relevant past facts. If the user mentions "fence," the AI will retrieve facts related to that topic and bring them up naturally:

python
Copy code
def respond_with_context(conversation_text, memory_manager):
    """Respond to the user by recalling past facts if relevant."""
    topic = "fence"  # This can be inferred from the conversation context using NLP.
    facts = memory_manager.retrieve_facts(topic)
    
    if facts:
        # Example response using the past fact
        return f"But remember, you mentioned last year that the city codes prevent you from raising the fence."
    else:
        return "Tell me more about your plans for the fence."

# Example conversation
conversation = "I'm going to raise that fence another three feet."
response = respond_with_context(conversation, memory_manager)
print(response)  # Expected output: "But remember, you mentioned last year that the city codes prevent you from raising the fence."
Step 4: Advanced Memory Features
Time-Based Recall: Facts could be tagged with timestamps, so the AI can mention when the user originally said something (e.g., “you mentioned last year”).

Topic Tracking: More advanced topic modeling could be used to automatically detect and track topics, so the user doesn’t need to explicitly mention a topic every time.

Contextual Matching: The system could also analyze partial matches between the current conversation and past facts. Even if the user doesn’t directly mention “fence,” the system could infer the relevance if the conversation context is similar.

Result
This system will give the AI the ability to naturally refer back to important user statements from previous conversations. It will enhance the AI's responsiveness by making it feel like it has a consistent memory, leading to more realistic and contextual interactions. Would you like to explore any other aspects of this memory system?






You said:
how can NLP infer the topic?
ChatGPT said:
ChatGPT
NLP can infer topics from text using a variety of techniques, from simple keyword extraction to more advanced machine learning methods. The goal is to identify the main subject or theme of a given conversation. Below are some methods that NLP uses to infer topics:

1. Keyword and Phrase Extraction
TF-IDF (Term Frequency-Inverse Document Frequency): This is a statistical technique that ranks words based on how frequently they appear in a document compared to other documents. Keywords that have a high TF-IDF score are considered more important.
Noun-Phrase Extraction: NLP models can identify noun phrases (e.g., "fence height," "city codes") as potential topics by using POS tagging (part-of-speech tagging), which classifies each word as a noun, verb, etc.
Example:

python
Copy code
from sklearn.feature_extraction.text import TfidfVectorizer

# Example set of conversations
conversations = ["I'm going to raise that fence another three feet.", 
                 "City codes prevent me from changing the fence.",
                 "Let's talk about AI and how it impacts society."]

# Use TF-IDF to extract keywords
vectorizer = TfidfVectorizer(stop_words="english")
tfidf_matrix = vectorizer.fit_transform(conversations)
feature_names = vectorizer.get_feature_names_out()

# Show keywords from first conversation
dense = tfidf_matrix.todense()
keyword_list = dense[0].tolist()[0]
phrase_scores = [pair for pair in zip(range(0, len(keyword_list)), keyword_list) if pair[1] > 0]
keywords = [feature_names[index] for index, score in phrase_scores]
print(keywords)  # This could return ['fence', 'three', 'feet']
2. Named Entity Recognition (NER)
NER is a method that identifies entities in text, such as people, organizations, locations, etc. If a specific named entity is consistently mentioned in the conversation, it can be inferred as the topic.

Example:

python
Copy code
from transformers import pipeline

ner = pipeline("ner")

conversation = "The city codes prevent me from raising the fence."
entities = ner(conversation)
print(entities)  # Expected output: [{'entity': 'LOC', 'word': 'city'}, {'entity': 'MISC', 'word': 'codes'}]
Entity categories can then be linked to potential topics, such as:

LOC (Location) could point to discussions about a place.
MISC (Miscellaneous) could point to rules, regulations, etc.
3. Topic Modeling with LDA (Latent Dirichlet Allocation)
LDA is a popular unsupervised technique used to detect topics in text. It assumes that each document is a mixture of various topics, and it tries to infer the most probable topics that generated the text.

Example with gensim:

python
Copy code
from gensim import corpora
from gensim.models import LdaModel

# Example conversations
documents = ["I'm going to raise that fence another three feet.", 
             "City codes prevent me from changing the fence.",
             "Let's talk about AI and its impact on society."]

# Preprocess the text
texts = [[word for word in document.lower().split()] for document in documents]
dictionary = corpora.Dictionary(texts)
corpus = [dictionary.doc2bow(text) for text in texts]

# Train LDA model
lda = LdaModel(corpus, num_topics=2, id2word=dictionary, passes=15)
topics = lda.print_topics(num_words=3)
print(topics)
LDA will output something like:

graphql
Copy code
[(0, '0.2*"fence" + 0.1*"codes" + 0.05*"three"'), 
 (1, '0.3*"ai" + 0.1*"impact" + 0.05*"society"')]
This suggests two topics: one about "fence" and "codes," and another about "AI" and "impact."

4. Zero-Shot Topic Classification
With zero-shot learning models, you don’t need to train the model on specific topics in advance. Instead, the model can infer the most likely topic based on predefined candidate topics provided at runtime.

Example using Hugging Face's zero-shot-classification:

python
Copy code
from transformers import pipeline

classifier = pipeline("zero-shot-classification")

conversation = "The city codes prevent me from raising the fence."
candidate_labels = ["construction", "legal issue", "personal development", "AI"]

result = classifier(conversation, candidate_labels)
print(result)
Expected output:

arduino
Copy code
{'labels': ['legal issue', 'construction', 'personal development', 'AI'], 
 'scores': [0.9, 0.7, 0.1, 0.05]}
Here, "legal issue" is the most likely topic, inferred from the discussion of "city codes."

5. BERT-Based Semantic Similarity Matching
Another approach is to use models like BERT to compute the semantic similarity between the current conversation and predefined topic categories. The model matches user statements to the closest topic based on meaning, even if exact words aren’t used.

Example:

python
Copy code
from sentence_transformers import SentenceTransformer, util

model = SentenceTransformer('paraphrase-MiniLM-L6-v2')

# Example conversation and topic list
conversation = "The city codes prevent me from raising the fence."
candidate_topics = ["construction issues", "legal restrictions", "AI development", "personal growth"]

# Encode conversation and topics
conversation_embedding = model.encode(conversation, convert_to_tensor=True)
topic_embeddings = model.encode(candidate_topics, convert_to_tensor=True)

# Compute similarity
similarities = util.pytorch_cos_sim(conversation_embedding, topic_embeddings)
best_match = candidate_topics[similarities.argmax()]
print(best_match)  # This should return 'legal restrictions' or 'construction issues' depending on the context.
6. Rule-Based Systems
For more structured and specific use cases, you can manually define rules to classify topics based on keywords or patterns.

Example:

python
Copy code
def rule_based_topic_detection(conversation_text):
    if "fence" in conversation_text.lower():
        return "construction"
    elif "city codes" in conversation_text.lower():
        return "legal issue"
    else:
        return "general"

# Example
conversation = "The city codes prevent me from raising the fence."
topic = rule_based_topic_detection(conversation)
print(topic)  # Output: "legal issue"
Conclusion
Using a combination of these methods, NLP can infer the topic of a conversation with varying degrees of accuracy, depending on the complexity and type of conversation. For a flexible system, combining zero-shot learning, LDA, and keyword extraction might give the best results in most general cases, while more structured systems can use rule-based approaches for highly specific domains. Would you like to implement one of these approaches for your system?

Home Made Robots / Re: Atronach's Eye

« Last post by WriterOfMinds on October 13, 2024, 09:52:42 pm »

Update! I got some problems with the motion tracking worked out. Switched from MOG2 background subtraction to Farneback Optical Flow as my method of finding motion in the visual field. Accuracy seems improved and I have a new video of the eye following me back and forth across the room. https://writerofminds.blogspot.com/2024/10/atronachs-eye-2024.html

AI Programming / Re: Running local AI models

« Last post by spydaz on October 07, 2024, 09:00:53 am »

Quote from: frankinstien on October 04, 2024, 11:51:42 pm

I'm using Lm Studio as a server and have used it as an app as well, but the LLMs out there are outstanding! They are getting smaller and are competitive with online solutions like Replika! Also, the ability to operate these without NSFW filters makes them great when plotting to rob banks or murders! LoL Or at least creating a script or novel along those lines, even horror and intimacy interactions are off the charts!

So, the ability to do other types of local models such as voice where solutions like parler-tts and tortoise-tts have excellent voice abilities where you can even customize them to whoever's voice you like! Also, Whisper can do the opposite STT, and no censorship! Also, there are photo and video solutions like LLaVA-NeXT where the AI can create an impression or create images and videos based on prompts.

Here's the good part integrating these into a system that can see, hear, and imagine is a reality, taking each output and prompting the other provides for a kind of feedback approach to create...well, some might argue, but a persona. Enhancing the prompts with other types of data and even using some causality models we might just get that person from science fiction and all done from a PC!

What's required on the local machine is more than one GPU, where RTX 4070 ti supers are selling for $650, but you mix and match what you want where perhaps using an RTX 4090 for image and video is best and apply the RTX 4070 ti to do the rest. With three GPUS with just the minimum of an RTX 4070 ti that's 50GB of ram! But perhaps you need more since you may what a VR setup as well and give your bot a virtual body!

It's just freaking fantastic what is possible today and it's free from the clutches of politically correct censorship. Let your imagination go and apply your skills towards integration and you could very well build a very sophisticated competitor to ChatGPT 40 that runs at home.

Now what's a challenge is the development of a hardbody, animatronic facial expressions (the generated prompts from LLM models are freaking great, they could be used to control expressions and even position a body!)

It's a great time for the enthusiast, for a while now I thought everything was going to be locked up in the corporate cloud, controlled through a pay interface, and monitored by Big Brother, but America proves itself to be the land of freedom, and the industry has opened up to the little guy...

yes i also use lmstudio as the AIP server !

The back end enablses you to see what is happening in the server this i like as well as you can control some settings!

I generally find that serving the models performs better tha loading them with the hugging face , But the airLLm is also good ~!~ ( useful)

the aim is to train a model ( 7b ) for tyour general needs ! ... ( i did this ) ( great model ) mistral !

Even to , create some custom architectures !
As they show all of the models code inside the hugging face library ! So you can replicate this easy with your own customized network :
Clone the trainsformers , and patch your model into the source and use thier library and training etc !

I personally had to switch over to python ! LOL!

SO now we can actually create MASTER APPS ! no Probs !

AI Programming / Re: Running local AI models

« Last post by frankinstien on October 07, 2024, 06:58:41 am »

Here's an interesting solution to run very large models by injecting into the GPU memory layers of the model at a time and retaining those outputs to deliver to the next layer. Interesting idea. It can also compress the models as well and improve the inferencing 3x! I haven't tried it yet but here's a video that did, however, the video was done before the compression feature was developed. But you can see how you can run perhaps a 2x to 3x model size over GPU memory. So, a 16GB GPU could easily run a 48GB model!

https://github.com/lyogavin/airllm

Video:

Pages: 1 2 [3] 4 5 ... 10

Recent Posts

Future of AI / Who's the AI?

Future of AI / Re: Will LLMs ever learn what is ... is?

General Project Discussion / Re: Project Acuitas

AI Programming / Re: Ai improving AI

AI Programming / Re: Ai improving AI

AI Programming / Re: Ai improving AI

AI Programming / Ai improving AI

Home Made Robots / Re: Atronach's Eye

AI Programming / Re: Running local AI models

AI Programming / Re: Running local AI models

Recent Topics

Recent News

Users Online

Articles