AutoGGUFEmbeddings

Companion object AutoGGUFEmbeddings

class AutoGGUFEmbeddings extends AnnotatorModel[AutoGGUFEmbeddings] with HasBatchedAnnotate[AutoGGUFEmbeddings] with HasEngine with HasLlamaCppModelProperties with HasProtectedParams

Annotator that uses the llama.cpp library to generate text embeddings with large language models.

The type of embedding pooling can be set with the setPoolingType method. The default is "MEAN". The available options are "MEAN", "CLS", and "LAST".

For all settable parameters, and their explanations, see HasLlamaCppModelProperties.

Pretrained models can be loaded with pretrained of the companion object:

val autoGGUFModel = AutoGGUFEmbeddings.pretrained()
  .setInputCols("document")
  .setOutputCol("embeddings")

The default model is "Qwen3_Embedding_0.6B_Q8_0_gguf", if no name is provided.

For available pretrained models please see the Models Hub.

For extended examples of usage, see the AutoGGUFEmbeddingsTest and the example notebook.

Note

To use GPU inference with this annotator, make sure to use the Spark NLP GPU package and set the number of GPU layers with the setNGpuLayers method.

When using larger models, we recommend adjusting GPU usage with setNCtx and setNGpuLayers according to your hardware to avoid out-of-memory errors.

Example

import com.johnsnowlabs.nlp.base._
import com.johnsnowlabs.nlp.annotator._
import org.apache.spark.ml.Pipeline
import spark.implicits._

val document = new DocumentAssembler().setInputCol("text").setOutputCol("document")

val autoGGUFModel = AutoGGUFEmbeddings
  .pretrained()
  .setInputCols("document")
  .setOutputCol("embeddings")
  .setBatchSize(4)
  .setPoolingType("MEAN")

val pipeline = new Pipeline().setStages(Array(document, autoGGUFModel))

val data = Seq(
  "The moons of Jupiter are 77 in total, with 79 confirmed natural satellites and 2 man-made ones.")
  .toDF("text")
val result = pipeline.fit(data).transform(data)
result.select("embeddings.embeddings").show(truncate = false)
+--------------------------------------------------------------------------------+
|                                                                      embeddings|
+--------------------------------------------------------------------------------+
|[[-0.034486726, 0.07770534, -0.15982522, -0.017873349, 0.013914132, 0.0365736...|
+--------------------------------------------------------------------------------+

Linear Supertypes

HasProtectedParams, HasLlamaCppModelProperties, HasEngine, HasBatchedAnnotate[AutoGGUFEmbeddings], AnnotatorModel[AutoGGUFEmbeddings], CanBeLazy, RawAnnotator[AutoGGUFEmbeddings], HasOutputAnnotationCol, HasInputAnnotationCols, HasOutputAnnotatorType, ParamsAndFeaturesWritable, HasFeatures, DefaultParamsWritable, MLWritable, Model[AutoGGUFEmbeddings], Transformer, PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any

Ordering

Grouped
Alphabetic
By Inheritance

Inherited

AutoGGUFEmbeddings
HasProtectedParams
HasLlamaCppModelProperties
HasEngine
HasBatchedAnnotate
AnnotatorModel
CanBeLazy
RawAnnotator
HasOutputAnnotationCol
HasInputAnnotationCols
HasOutputAnnotatorType
ParamsAndFeaturesWritable
HasFeatures
DefaultParamsWritable
MLWritable
Model
Transformer
PipelineStage
Logging
Params
Serializable
Serializable
Identifiable
AnyRef
Any

Hide All
Show All

Visibility

Public
All

Instance Constructors

new AutoGGUFEmbeddings()
Annotator reference id.
Annotator reference id. Used to identify elements in metadata or to refer to this annotator type
new AutoGGUFEmbeddings(uid: String)
uid
required uid for storing annotator to disk

Type Members

implicit class ProtectedParam[T] extends Param[T]

Definition Classes
HasProtectedParams
type AnnotationContent = Seq[Row]
internal types to show Rows as a relevant StructType Should be deleted once Spark releases UserDefinedTypes to @developerAPI
internal types to show Rows as a relevant StructType Should be deleted once Spark releases UserDefinedTypes to @developerAPI

Attributes
protected
Definition Classes
AnnotatorModel
type AnnotatorType = String

Definition Classes
HasOutputAnnotatorType

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def $[T](param: Param[T]): T

Attributes
protected
Definition Classes
Params
def $$[T](feature: StructFeature[T]): T

Attributes
protected
Definition Classes
HasFeatures
def $$[K, V](feature: MapFeature[K, V]): Map[K, V]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: SetFeature[T]): Set[T]

Attributes
protected
Definition Classes
HasFeatures
def $$[T](feature: ArrayFeature[T]): Array[T]

Attributes
protected
Definition Classes
HasFeatures
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def _transform(dataset: Dataset[_], recursivePipeline: Option[PipelineModel]): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
def afterAnnotate(dataset: DataFrame): DataFrame

Attributes
protected
Definition Classes
AnnotatorModel
final def asInstanceOf[T0]: T0

Definition Classes
Any
def batchAnnotate(batchedAnnotations: Seq[Array[Annotation]]): Seq[Seq[Annotation]]
Completes the batch of annotations.
Completes the batch of annotations.
batchedAnnotations
Annotations (single element arrays) in batches
returns
Completed text sequences

Definition Classes
AutoGGUFEmbeddings → HasBatchedAnnotate
def batchProcess(rows: Iterator[_]): Iterator[Row]

Definition Classes
HasBatchedAnnotate
val batchSize: IntParam
Size of every batch (Default depends on model).
Size of every batch (Default depends on model).

Definition Classes
HasBatchedAnnotate
def beforeAnnotate(dataset: Dataset[_]): Dataset[_]

Attributes
protected
Definition Classes
AnnotatorModel
val chatTemplate: Param[String]

Definition Classes
HasLlamaCppModelProperties
final def checkSchema(schema: StructType, inputAnnotatorType: String): Boolean

Attributes
protected
Definition Classes
HasInputAnnotationCols
final def clear(param: Param[_]): AutoGGUFEmbeddings.this.type

Definition Classes
Params
def clone(): AnyRef

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
def close(): Unit
Closes the llama.cpp model backend freeing resources.
Closes the llama.cpp model backend freeing resources. The model is reloaded when used again.
def copy(extra: ParamMap): AutoGGUFEmbeddings
requirement for annotators copies
requirement for annotators copies

Definition Classes
RawAnnotator → Model → Transformer → PipelineStage → Params
def copyValues[T <: Params](to: T, extra: ParamMap): T

Attributes
protected
Definition Classes
Params
final def defaultCopy[T <: Params](extra: ParamMap): T

Attributes
protected
Definition Classes
Params
val defragmentationThreshold: FloatParam

Definition Classes
HasLlamaCppModelProperties
val disableLog: BooleanParam

Definition Classes
HasLlamaCppModelProperties
val engine: Param[String]
This param is set internally once via loadSavedModel.
This param is set internally once via loadSavedModel. That's why there is no setter

Definition Classes
HasEngine
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def explainParam(param: Param[_]): String

Definition Classes
Params
def explainParams(): String

Definition Classes
Params
def extraValidate(structType: StructType): Boolean

Attributes
protected
Definition Classes
RawAnnotator
def extraValidateMsg: String
Override for additional custom schema checks
Override for additional custom schema checks

Attributes
protected
Definition Classes
RawAnnotator
final def extractParamMap(): ParamMap

Definition Classes
Params
final def extractParamMap(extra: ParamMap): ParamMap

Definition Classes
Params
val features: ArrayBuffer[Feature[_, _, _]]

Definition Classes
HasFeatures
def finalize(): Unit

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
val flashAttention: BooleanParam

Definition Classes
HasLlamaCppModelProperties
def get[T](feature: StructFeature[T]): Option[T]

Attributes
protected
Definition Classes
HasFeatures
def get[K, V](feature: MapFeature[K, V]): Option[Map[K, V]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: SetFeature[T]): Option[Set[T]]

Attributes
protected
Definition Classes
HasFeatures
def get[T](feature: ArrayFeature[T]): Option[Array[T]]

Attributes
protected
Definition Classes
HasFeatures
final def get[T](param: Param[T]): Option[T]

Definition Classes
Params
def getBatchSize: Int
Size of every batch.
Size of every batch.

Definition Classes
HasBatchedAnnotate
def getChatTemplate: String

Definition Classes
HasLlamaCppModelProperties
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
Annotations
@native()
final def getDefault[T](param: Param[T]): Option[T]

Definition Classes
Params
def getDefragmentationThreshold: Float

Definition Classes
HasLlamaCppModelProperties
def getDisableLog: Boolean

Definition Classes
HasLlamaCppModelProperties
def getEngine: String

Definition Classes
HasEngine
def getFlashAttention: Boolean

Definition Classes
HasLlamaCppModelProperties
def getInputCols: Array[String]
returns
input annotations columns currently used

Definition Classes
HasInputAnnotationCols
def getLazyAnnotator: Boolean

Definition Classes
CanBeLazy
def getLogVerbosity: Int

Definition Classes
HasLlamaCppModelProperties
def getMainGpu: Int

Definition Classes
HasLlamaCppModelProperties
def getMetadata: String
Get the metadata for the model
Get the metadata for the model

Definition Classes
HasLlamaCppModelProperties
def getMetadataMap: Map[String, Map[String, String]]

Definition Classes
HasLlamaCppModelProperties
def getModelDraft: String

Definition Classes
HasLlamaCppModelProperties
def getModelIfNotSet: GGUFWrapper
def getModelParameters: ModelParameters

Attributes
protected
Definition Classes
HasLlamaCppModelProperties
def getNBatch: Int

Definition Classes
HasLlamaCppModelProperties
def getNCtx: Int

Definition Classes
HasLlamaCppModelProperties
def getNDraft: Int

Definition Classes
HasLlamaCppModelProperties
def getNGpuLayers: Int

Definition Classes
HasLlamaCppModelProperties
def getNGpuLayersDraft: Int

Definition Classes
HasLlamaCppModelProperties
def getNThreads: Int

Definition Classes
HasLlamaCppModelProperties
def getNThreadsBatch: Int

Definition Classes
HasLlamaCppModelProperties
def getNUbatch: Int

Definition Classes
HasLlamaCppModelProperties
def getNoKvOffload: Boolean

Definition Classes
HasLlamaCppModelProperties
def getNuma: String

Definition Classes
HasLlamaCppModelProperties
final def getOrDefault[T](param: Param[T]): T

Definition Classes
Params
final def getOutputCol: String
Gets annotation column name going to generate
Gets annotation column name going to generate

Definition Classes
HasOutputAnnotationCol
def getParam(paramName: String): Param[Any]

Definition Classes
Params
def getPoolingType: String
def getReasoningBudget: Int

Definition Classes
HasLlamaCppModelProperties
def getRopeFreqBase: Float

Definition Classes
HasLlamaCppModelProperties
def getRopeFreqScale: Float

Definition Classes
HasLlamaCppModelProperties
def getRopeScalingType: String

Definition Classes
HasLlamaCppModelProperties
def getSplitMode: String

Definition Classes
HasLlamaCppModelProperties
def getSystemPrompt: String

Definition Classes
HasLlamaCppModelProperties
def getUseMlock: Boolean

Definition Classes
HasLlamaCppModelProperties
def getUseMmap: Boolean

Definition Classes
HasLlamaCppModelProperties
def getYarnAttnFactor: Float

Definition Classes
HasLlamaCppModelProperties
def getYarnBetaFast: Float

Definition Classes
HasLlamaCppModelProperties
def getYarnBetaSlow: Float

Definition Classes
HasLlamaCppModelProperties
def getYarnExtFactor: Float

Definition Classes
HasLlamaCppModelProperties
def getYarnOrigCtx: Int

Definition Classes
HasLlamaCppModelProperties
val gpuSplitMode: Param[String]
Set how to split the model across GPUs
Set how to split the model across GPUs
- NONE: No GPU split
- LAYER: Split the model across GPUs by layer
- ROW: Split the model across GPUs by rows
Definition Classes
HasLlamaCppModelProperties
final def hasDefault[T](param: Param[T]): Boolean

Definition Classes
Params
def hasParam(paramName: String): Boolean

Definition Classes
Params
def hasParent: Boolean

Definition Classes
Model
def hashCode(): Int

Definition Classes
AnyRef → Any
Annotations
@native()
def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

Attributes
protected
Definition Classes
Logging
def initializeLogIfNecessary(isInterpreter: Boolean): Unit

Attributes
protected
Definition Classes
Logging
val inputAnnotatorTypes: Array[AnnotatorType]
Annotator reference id.
Annotator reference id. Used to identify elements in metadata or to refer to this annotator type

Definition Classes
AutoGGUFEmbeddings → HasInputAnnotationCols
final val inputCols: StringArrayParam
columns that contain annotations necessary to run this annotator AnnotatorType is used both as input and output columns if not specified
columns that contain annotations necessary to run this annotator AnnotatorType is used both as input and output columns if not specified

Attributes
protected
Definition Classes
HasInputAnnotationCols
final def isDefined(param: Param[_]): Boolean

Definition Classes
Params
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def isSet(param: Param[_]): Boolean

Definition Classes
Params
def isTraceEnabled(): Boolean

Attributes
protected
Definition Classes
Logging
val lazyAnnotator: BooleanParam

Definition Classes
CanBeLazy
def log: Logger

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logName: String

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
val logVerbosity: IntParam

Definition Classes
HasLlamaCppModelProperties
def logWarning(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
val logger: Logger

Attributes
protected
Definition Classes
HasLlamaCppModelProperties
val mainGpu: IntParam

Definition Classes
HasLlamaCppModelProperties
val metadata: ProtectedParam[String]

Definition Classes
HasLlamaCppModelProperties
val modelDraft: Param[String]

Definition Classes
HasLlamaCppModelProperties
def msgHelper(schema: StructType): String

Attributes
protected
Definition Classes
HasInputAnnotationCols
val nBatch: IntParam

Definition Classes
HasLlamaCppModelProperties
val nCtx: IntParam

Definition Classes
HasLlamaCppModelProperties
val nDraft: IntParam

Definition Classes
HasLlamaCppModelProperties
val nGpuLayers: IntParam

Definition Classes
HasLlamaCppModelProperties
val nGpuLayersDraft: IntParam

Definition Classes
HasLlamaCppModelProperties
val nThreads: IntParam

Definition Classes
HasLlamaCppModelProperties
val nThreadsBatch: IntParam

Definition Classes
HasLlamaCppModelProperties
val nUbatch: IntParam

Definition Classes
HasLlamaCppModelProperties
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
val noKvOffload: BooleanParam

Definition Classes
HasLlamaCppModelProperties
final def notify(): Unit

Definition Classes
AnyRef
Annotations
@native()
final def notifyAll(): Unit

Definition Classes
AnyRef
Annotations
@native()
val numaStrategy: Param[String]
Set optimization strategies that help on some NUMA systems (if available)
Set optimization strategies that help on some NUMA systems (if available)
Available Strategies:
- DISABLED: No NUMA optimizations
- DISTRIBUTE: Spread execution evenly over all
- ISOLATE: Only spawn threads on CPUs on the node that execution started on
- NUMA_CTL: Use the CPU map provided by numactl
- MIRROR: Mirrors the model across NUMA nodes
Definition Classes
HasLlamaCppModelProperties
def onWrite(path: String, spark: SparkSession): Unit

Definition Classes
AutoGGUFEmbeddings → ParamsAndFeaturesWritable
val optionalInputAnnotatorTypes: Array[String]

Definition Classes
HasInputAnnotationCols
val outputAnnotatorType: AnnotatorType

Definition Classes
AutoGGUFEmbeddings → HasOutputAnnotatorType
final val outputCol: Param[String]

Attributes
protected
Definition Classes
HasOutputAnnotationCol
lazy val params: Array[Param[_]]

Definition Classes
Params
var parent: Estimator[AutoGGUFEmbeddings]

Definition Classes
Model
val poolingType: Param[String]
Set the pooling type for embeddings, use model default if unspecified
Set the pooling type for embeddings, use model default if unspecified
- MEAN: Mean Pooling
- CLS: Choose the CLS token
- LAST: Choose the last token
val reasoningBudget: IntParam

Definition Classes
HasLlamaCppModelProperties
val ropeFreqBase: FloatParam

Definition Classes
HasLlamaCppModelProperties
val ropeFreqScale: FloatParam

Definition Classes
HasLlamaCppModelProperties
val ropeScalingType: Param[String]
Set the RoPE frequency scaling method, defaults to linear unless specified by the model.
Set the RoPE frequency scaling method, defaults to linear unless specified by the model.
- UNSPECIFIED: Don't use any scaling
- LINEAR: Linear scaling
- YARN: YaRN RoPE scaling
Definition Classes
HasLlamaCppModelProperties
def save(path: String): Unit

Definition Classes
MLWritable
Annotations
@Since( "1.6.0" ) @throws( ... )
def set[T](param: ProtectedParam[T], value: T): AutoGGUFEmbeddings.this.type
Sets the value for a protected Param.
Sets the value for a protected Param.
If the parameter was already set, it will not be set again. Default values do not count as a set value and can be overridden.
T
Type of the parameter
param
Protected parameter to set
value
Value for the parameter
returns
This object

Definition Classes
HasProtectedParams
def set[T](feature: StructFeature[T], value: T): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[K, V](feature: MapFeature[K, V], value: Map[K, V]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: SetFeature[T], value: Set[T]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def set[T](feature: ArrayFeature[T], value: Array[T]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
final def set(paramPair: ParamPair[_]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
Params
final def set(param: String, value: Any): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
Params
final def set[T](param: Param[T], value: T): AutoGGUFEmbeddings.this.type

Definition Classes
Params
def setBatchSize(size: Int): AutoGGUFEmbeddings.this.type
Size of every batch.
Size of every batch.

Definition Classes
HasBatchedAnnotate
def setChatTemplate(chatTemplate: String): AutoGGUFEmbeddings.this.type
The chat template to use
The chat template to use

Definition Classes
HasLlamaCppModelProperties
def setDefault[T](feature: StructFeature[T], value: () ⇒ T): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[K, V](feature: MapFeature[K, V], value: () ⇒ Map[K, V]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: SetFeature[T], value: () ⇒ Set[T]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
def setDefault[T](feature: ArrayFeature[T], value: () ⇒ Array[T]): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
HasFeatures
final def setDefault(paramPairs: ParamPair[_]*): AutoGGUFEmbeddings.this.type

Attributes
protected
Definition Classes
Params
final def setDefault[T](param: Param[T], value: T): AutoGGUFEmbeddings.this.type

Attributes
protected[org.apache.spark.ml]
Definition Classes
Params
def setDefragmentationThreshold(defragThold: Float): AutoGGUFEmbeddings.this.type
Set the KV cache defragmentation threshold
Set the KV cache defragmentation threshold

Definition Classes
HasLlamaCppModelProperties
def setDisableLog(disableLog: Boolean): AutoGGUFEmbeddings.this.type

Definition Classes
HasLlamaCppModelProperties
def setFlashAttention(flashAttention: Boolean): AutoGGUFEmbeddings.this.type
Whether to enable Flash Attention
Whether to enable Flash Attention

Definition Classes
HasLlamaCppModelProperties
def setGpuSplitMode(splitMode: String): AutoGGUFEmbeddings.this.type
Set how to split the model across GPUs
Set how to split the model across GPUs
- NONE: No GPU split -LAYER: Split the model across GPUs by layer 2. ROW: Split the model across GPUs by rows
Definition Classes
HasLlamaCppModelProperties
final def setInputCols(value: String*): AutoGGUFEmbeddings.this.type

Definition Classes
HasInputAnnotationCols
def setInputCols(value: Array[String]): AutoGGUFEmbeddings.this.type
Overrides required annotators column if different than default
Overrides required annotators column if different than default

Definition Classes
HasInputAnnotationCols
def setLazyAnnotator(value: Boolean): AutoGGUFEmbeddings.this.type

Definition Classes
CanBeLazy
def setLogVerbosity(logVerbosity: Int): AutoGGUFEmbeddings.this.type
Set the verbosity threshold.
Set the verbosity threshold. Messages with a higher verbosity will be ignored.
Values map to the following:
- GGML_LOG_LEVEL_NONE = 0
- GGML_LOG_LEVEL_DEBUG = 1
- GGML_LOG_LEVEL_INFO = 2
- GGML_LOG_LEVEL_WARN = 3
- GGML_LOG_LEVEL_ERROR = 4
- GGML_LOG_LEVEL_CONT = 5 (continue previous log)
Definition Classes
HasLlamaCppModelProperties
def setMainGpu(mainGpu: Int): AutoGGUFEmbeddings.this.type
Set the GPU that is used for scratch and small tensors
Set the GPU that is used for scratch and small tensors

Definition Classes
HasLlamaCppModelProperties
def setMetadata(metadata: String): AutoGGUFEmbeddings.this.type
Set the metadata for the model
Set the metadata for the model

Definition Classes
HasLlamaCppModelProperties
def setModelDraft(modelDraft: String): AutoGGUFEmbeddings.this.type
Set the draft model for speculative decoding
Set the draft model for speculative decoding

Definition Classes
HasLlamaCppModelProperties
def setModelIfNotSet(spark: SparkSession, wrapper: GGUFWrapper): AutoGGUFEmbeddings.this.type
def setNBatch(nBatch: Int): AutoGGUFEmbeddings.this.type
Set the logical batch size for prompt processing (must be >=32 to use BLAS)
Set the logical batch size for prompt processing (must be >=32 to use BLAS)

Definition Classes
HasLlamaCppModelProperties
def setNCtx(nCtx: Int): AutoGGUFEmbeddings.this.type
Set the size of the prompt context
Set the size of the prompt context

Definition Classes
HasLlamaCppModelProperties
def setNDraft(nDraft: Int): AutoGGUFEmbeddings.this.type
Set the number of tokens to draft for speculative decoding
Set the number of tokens to draft for speculative decoding

Definition Classes
HasLlamaCppModelProperties
def setNGpuLayers(nGpuLayers: Int): AutoGGUFEmbeddings.this.type
Set the number of layers to store in VRAM (-1 - use default)
Set the number of layers to store in VRAM (-1 - use default)

Definition Classes
HasLlamaCppModelProperties
def setNGpuLayersDraft(nGpuLayersDraft: Int): AutoGGUFEmbeddings.this.type
Set the number of layers to store in VRAM for the draft model (-1 - use default)
Set the number of layers to store in VRAM for the draft model (-1 - use default)

Definition Classes
HasLlamaCppModelProperties
def setNParallel(nParallel: Int): AutoGGUFEmbeddings.this.type
Sets the number of parallel processes for decoding.
Sets the number of parallel processes for decoding. This is an alias for setBatchSize.
nParallel
The number of parallel processes for decoding
def setNThreads(nThreads: Int): AutoGGUFEmbeddings.this.type
Set the number of threads to use during generation
Set the number of threads to use during generation

Definition Classes
HasLlamaCppModelProperties
def setNThreadsBatch(nThreadsBatch: Int): AutoGGUFEmbeddings.this.type
Set the number of threads to use during batch and prompt processing
Set the number of threads to use during batch and prompt processing

Definition Classes
HasLlamaCppModelProperties
def setNUbatch(nUbatch: Int): AutoGGUFEmbeddings.this.type
Set the physical batch size for prompt processing (must be >=32 to use BLAS)
Set the physical batch size for prompt processing (must be >=32 to use BLAS)

Definition Classes
HasLlamaCppModelProperties
def setNoKvOffload(noKvOffload: Boolean): AutoGGUFEmbeddings.this.type
Whether to disable KV offload
Whether to disable KV offload

Definition Classes
HasLlamaCppModelProperties
def setNumaStrategy(numa: String): AutoGGUFEmbeddings.this.type
Set optimization strategies that help on some NUMA systems (if available)
Set optimization strategies that help on some NUMA systems (if available)
Available Strategies:
- DISABLED: No NUMA optimizations
- DISTRIBUTE: spread execution evenly over all
- ISOLATE: only spawn threads on CPUs on the node that execution started on
- NUMA_CTL: use the CPU map provided by numactl
- MIRROR: Mirrors the model across NUMA nodes
Definition Classes
HasLlamaCppModelProperties
final def setOutputCol(value: String): AutoGGUFEmbeddings.this.type
Overrides annotation column name when transforming
Overrides annotation column name when transforming

Definition Classes
HasOutputAnnotationCol
def setParent(parent: Estimator[AutoGGUFEmbeddings]): AutoGGUFEmbeddings

Definition Classes
Model
def setPoolingType(poolingType: String): AutoGGUFEmbeddings.this.type
Set the pooling type for embeddings, use model default if unspecified.
Set the pooling type for embeddings, use model default if unspecified.
Possible values:
- MEAN: Mean pooling
- CLS: Choose the CLS token
- LAST: Choose the last token
- RANK: For reranking
def setReasoningBudget(reasoningBudget: Int): AutoGGUFEmbeddings.this.type
Controls the amount of thinking allowed; currently only one of: -1 for unrestricted thinking budget, or 0 to disable thinking (default: -1)
Controls the amount of thinking allowed; currently only one of: -1 for unrestricted thinking budget, or 0 to disable thinking (default: -1)

Definition Classes
HasLlamaCppModelProperties
def setRopeFreqBase(ropeFreqBase: Float): AutoGGUFEmbeddings.this.type
Set the RoPE base frequency, used by NTK-aware scaling
Set the RoPE base frequency, used by NTK-aware scaling

Definition Classes
HasLlamaCppModelProperties
def setRopeFreqScale(ropeFreqScale: Float): AutoGGUFEmbeddings.this.type
Set the RoPE frequency scaling factor, expands context by a factor of 1/N
Set the RoPE frequency scaling factor, expands context by a factor of 1/N

Definition Classes
HasLlamaCppModelProperties
def setRopeScalingType(ropeScalingType: String): AutoGGUFEmbeddings.this.type
Set the RoPE frequency scaling method, defaults to linear unless specified by the model.
Set the RoPE frequency scaling method, defaults to linear unless specified by the model.
- NONE: Don't use any scaling
- LINEAR: Linear scaling
- YARN: YaRN RoPE scaling
Definition Classes
HasLlamaCppModelProperties
def setSystemPrompt(systemPrompt: String): AutoGGUFEmbeddings.this.type
Set a system prompt to use
Set a system prompt to use

Definition Classes
HasLlamaCppModelProperties
def setUseMlock(useMlock: Boolean): AutoGGUFEmbeddings.this.type
Whether to force the system to keep model in RAM rather than swapping or compressing
Whether to force the system to keep model in RAM rather than swapping or compressing

Definition Classes
HasLlamaCppModelProperties
def setUseMmap(useMmap: Boolean): AutoGGUFEmbeddings.this.type
Whether to use memory-map model (faster load but may increase pageouts if not using mlock)
Whether to use memory-map model (faster load but may increase pageouts if not using mlock)

Definition Classes
HasLlamaCppModelProperties
def setYarnAttnFactor(yarnAttnFactor: Float): AutoGGUFEmbeddings.this.type
Set the YaRN scale sqrt(t) or attention magnitude
Set the YaRN scale sqrt(t) or attention magnitude

Definition Classes
HasLlamaCppModelProperties
def setYarnBetaFast(yarnBetaFast: Float): AutoGGUFEmbeddings.this.type
Set the YaRN low correction dim or beta
Set the YaRN low correction dim or beta

Definition Classes
HasLlamaCppModelProperties
def setYarnBetaSlow(yarnBetaSlow: Float): AutoGGUFEmbeddings.this.type
Set the YaRN high correction dim or alpha
Set the YaRN high correction dim or alpha

Definition Classes
HasLlamaCppModelProperties
def setYarnExtFactor(yarnExtFactor: Float): AutoGGUFEmbeddings.this.type
Set the YaRN extrapolation mix factor
Set the YaRN extrapolation mix factor

Definition Classes
HasLlamaCppModelProperties
def setYarnOrigCtx(yarnOrigCtx: Int): AutoGGUFEmbeddings.this.type
Set the YaRN original context size of model
Set the YaRN original context size of model

Definition Classes
HasLlamaCppModelProperties
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
val systemPrompt: Param[String]

Definition Classes
HasLlamaCppModelProperties
def toString(): String

Definition Classes
Identifiable → AnyRef → Any
final def transform(dataset: Dataset[_]): DataFrame
Given requirements are met, this applies ML transformation within a Pipeline or stand-alone Output annotation will be generated as a new column, previous annotations are still available separately metadata is built at schema level to record annotations structural information outside its content
Given requirements are met, this applies ML transformation within a Pipeline or stand-alone Output annotation will be generated as a new column, previous annotations are still available separately metadata is built at schema level to record annotations structural information outside its content
dataset
Dataset[Row]

Definition Classes
AnnotatorModel → Transformer
def transform(dataset: Dataset[_], paramMap: ParamMap): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" )
def transform(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): DataFrame

Definition Classes
Transformer
Annotations
@Since( "2.0.0" ) @varargs()
final def transformSchema(schema: StructType): StructType
requirement for pipeline transformation validation.
requirement for pipeline transformation validation. It is called on fit()

Definition Classes
RawAnnotator → PipelineStage
def transformSchema(schema: StructType, logging: Boolean): StructType

Attributes
protected
Definition Classes
PipelineStage
Annotations
@DeveloperApi()
val uid: String

Definition Classes
AutoGGUFEmbeddings → Identifiable
val useMlock: BooleanParam

Definition Classes
HasLlamaCppModelProperties
val useMmap: BooleanParam

Definition Classes
HasLlamaCppModelProperties
def validate(schema: StructType): Boolean
takes a Dataset and checks to see if all the required annotation types are present.
takes a Dataset and checks to see if all the required annotation types are present.
schema
to be validated
returns
True if all the required types are present, else false

Attributes
protected
Definition Classes
RawAnnotator
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... ) @native()
def wrapColumnMetadata(col: Column): Column

Attributes
protected
Definition Classes
RawAnnotator
def write: MLWriter

Definition Classes
ParamsAndFeaturesWritable → DefaultParamsWritable → MLWritable
val yarnAttnFactor: FloatParam

Definition Classes
HasLlamaCppModelProperties
val yarnBetaFast: FloatParam

Definition Classes
HasLlamaCppModelProperties
val yarnBetaSlow: FloatParam

Definition Classes
HasLlamaCppModelProperties
val yarnExtFactor: FloatParam

Definition Classes
HasLlamaCppModelProperties
val yarnOrigCtx: IntParam

Definition Classes
HasLlamaCppModelProperties

Inherited from HasProtectedParams

Inherited from HasLlamaCppModelProperties

Inherited from HasEngine

Inherited from HasBatchedAnnotate[AutoGGUFEmbeddings]

Inherited from AnnotatorModel[AutoGGUFEmbeddings]

Inherited from CanBeLazy

Inherited from RawAnnotator[AutoGGUFEmbeddings]

Inherited from HasOutputAnnotationCol

Inherited from HasInputAnnotationCols

Inherited from HasOutputAnnotatorType

Inherited from ParamsAndFeaturesWritable

Inherited from HasFeatures

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from Model[AutoGGUFEmbeddings]

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Parameters

A list of (hyper-)parameter keys this annotator can take. Users can set and get the parameter values through setters and getters, respectively.

Packages

AutoGGUFEmbeddings

Companion object AutoGGUFEmbeddings

class AutoGGUFEmbeddings extends AnnotatorModel[AutoGGUFEmbeddings] with HasBatchedAnnotate[AutoGGUFEmbeddings] with HasEngine with HasLlamaCppModelProperties with HasProtectedParams

Note

Example

Instance Constructors

Type Members

Value Members

Inherited from HasProtectedParams

Inherited from HasLlamaCppModelProperties

Inherited from HasEngine

Inherited from HasBatchedAnnotate[AutoGGUFEmbeddings]

Inherited from AnnotatorModel[AutoGGUFEmbeddings]

Inherited from CanBeLazy

Inherited from RawAnnotator[AutoGGUFEmbeddings]

Inherited from HasOutputAnnotationCol

Inherited from HasInputAnnotationCols

Inherited from HasOutputAnnotatorType

Inherited from ParamsAndFeaturesWritable

Inherited from HasFeatures

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from Model[AutoGGUFEmbeddings]

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Parameters

Members

Parameter setters

Parameter getters

Packages

AutoGGUFEmbeddings 

Companion object AutoGGUFEmbeddings

class AutoGGUFEmbeddings extends AnnotatorModel[AutoGGUFEmbeddings] with HasBatchedAnnotate[AutoGGUFEmbeddings] with HasEngine with HasLlamaCppModelProperties with HasProtectedParams

Note

Example

Instance Constructors

Type Members

Value Members

Inherited from HasProtectedParams

Inherited from HasLlamaCppModelProperties

Inherited from HasEngine

Inherited from HasBatchedAnnotate[AutoGGUFEmbeddings]

Inherited from AnnotatorModel[AutoGGUFEmbeddings]

Inherited from CanBeLazy

Inherited from RawAnnotator[AutoGGUFEmbeddings]

Inherited from HasOutputAnnotationCol

Inherited from HasInputAnnotationCols

Inherited from HasOutputAnnotatorType

Inherited from ParamsAndFeaturesWritable

Inherited from HasFeatures

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from Model[AutoGGUFEmbeddings]

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Parameters

Members

Parameter setters

Parameter getters

AutoGGUFEmbeddings